Overview

Brought to you by YData

Dataset statistics

Number of variables75
Number of observations18866
Missing cells485211
Missing cells (%)34.3%
Total size in memory10.8 MiB
Average record size in memory600.0 B

Variable types

Text75

Dataset

DescriptionVertebrate Zoology Division - Mammalogy, Yale Peabody Museum 0061684-241126133413365
URLhttps://doi.org/10.15468/dl.shrths

Alerts

accessRights has constant value "Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj" Constant
license has constant value "http://creativecommons.org/publicdomain/zero/1.0/" Constant
rightsHolder has constant value "Yale Peabody Museum" Constant
institutionCode has constant value "YPM" Constant
collectionCode has constant value "VZ" Constant
ownerInstitutionCode has constant value "YPM" Constant
basisOfRecord has constant value "PreservedSpecimen" Constant
dataGeneralizations has constant value "Coordinate data unavailable" Constant
kingdom has constant value "Animalia" Constant
phylum has constant value "Chordata" Constant
class has constant value "Mammalia" Constant
nomenclaturalCode has constant value "ICZN" Constant
taxonRemarks has constant value "Animals and Plants: Vertebrates - Mammals" Constant
dataGeneralizations has 18800 (99.7%) missing values Missing
recordedBy has 4296 (22.8%) missing values Missing
sex has 10118 (53.6%) missing values Missing
lifeStage has 17900 (94.9%) missing values Missing
reproductiveCondition has 16576 (87.9%) missing values Missing
behavior has 18864 (> 99.9%) missing values Missing
preparations has 349 (1.8%) missing values Missing
associatedMedia has 18411 (97.6%) missing values Missing
associatedReferences has 12450 (66.0%) missing values Missing
associatedTaxa has 18487 (98.0%) missing values Missing
otherCatalogNumbers has 12652 (67.1%) missing values Missing
fieldNumber has 11555 (61.2%) missing values Missing
eventDate has 6221 (33.0%) missing values Missing
year has 6267 (33.2%) missing values Missing
month has 7343 (38.9%) missing values Missing
day has 7899 (41.9%) missing values Missing
habitat has 18739 (99.3%) missing values Missing
higherGeography has 3778 (20.0%) missing values Missing
continent has 3913 (20.7%) missing values Missing
waterBody has 18739 (99.3%) missing values Missing
country has 3927 (20.8%) missing values Missing
stateProvince has 5347 (28.3%) missing values Missing
county has 9192 (48.7%) missing values Missing
municipality has 18309 (97.0%) missing values Missing
locality has 5869 (31.1%) missing values Missing
minimumElevationInMeters has 17391 (92.2%) missing values Missing
maximumElevationInMeters has 18082 (95.8%) missing values Missing
verbatimElevation has 17391 (92.2%) missing values Missing
decimalLatitude has 5543 (29.4%) missing values Missing
decimalLongitude has 5543 (29.4%) missing values Missing
geodeticDatum has 5666 (30.0%) missing values Missing
coordinateUncertaintyInMeters has 5609 (29.7%) missing values Missing
georeferencedBy has 18537 (98.3%) missing values Missing
georeferencedDate has 10549 (55.9%) missing values Missing
georeferenceProtocol has 5610 (29.7%) missing values Missing
georeferenceSources has 5615 (29.8%) missing values Missing
georeferenceRemarks has 5661 (30.0%) missing values Missing
typeStatus has 18844 (99.9%) missing values Missing
identifiedBy has 17735 (94.0%) missing values Missing
dateIdentified has 17913 (94.9%) missing values Missing
identificationRemarks has 18863 (> 99.9%) missing values Missing
order has 401 (2.1%) missing values Missing
family has 838 (4.4%) missing values Missing
genus has 1196 (6.3%) missing values Missing
specificEpithet has 2296 (12.2%) missing values Missing
infraspecificEpithet has 8470 (44.9%) missing values Missing
scientificNameAuthorship has 385 (2.0%) missing values Missing
gbifID has unique values Unique
bibliographicCitation has unique values Unique
references has unique values Unique
dynamicProperties has unique values Unique
occurrenceID has unique values Unique
catalogNumber has unique values Unique

Reproduction

Analysis started2025-01-14 16:27:27.164590
Analysis finished2025-01-14 16:27:28.862490
Duration1.7 second
Software versionydata-profiling vv4.12.1
Download configurationconfig.json

Variables

gbifID
Text

Unique 

Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:29.032895image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length10
Min length10

Characters and Unicode

Total characters188660
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st row4953409301
2nd row4911830319
3rd row4911830318
4th row4911830317
5th row4911830316
ValueCountFrequency (%)
4953409301 1
 
< 0.1%
4599382340 1
 
< 0.1%
4911830315 1
 
< 0.1%
4911830314 1
 
< 0.1%
4911830313 1
 
< 0.1%
4911830312 1
 
< 0.1%
4911830311 1
 
< 0.1%
4911830310 1
 
< 0.1%
4911830309 1
 
< 0.1%
4911830308 1
 
< 0.1%
Other values (18856) 18856
99.9%
2025-01-14T11:27:29.307145image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 30292
16.1%
3 27042
14.3%
5 25137
13.3%
9 22536
11.9%
0 22490
11.9%
2 21472
11.4%
4 11335
 
6.0%
7 10804
 
5.7%
8 8933
 
4.7%
6 8619
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 188660
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 30292
16.1%
3 27042
14.3%
5 25137
13.3%
9 22536
11.9%
0 22490
11.9%
2 21472
11.4%
4 11335
 
6.0%
7 10804
 
5.7%
8 8933
 
4.7%
6 8619
 
4.6%

Most occurring scripts

ValueCountFrequency (%)
Common 188660
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 30292
16.1%
3 27042
14.3%
5 25137
13.3%
9 22536
11.9%
0 22490
11.9%
2 21472
11.4%
4 11335
 
6.0%
7 10804
 
5.7%
8 8933
 
4.7%
6 8619
 
4.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 188660
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 30292
16.1%
3 27042
14.3%
5 25137
13.3%
9 22536
11.9%
0 22490
11.9%
2 21472
11.4%
4 11335
 
6.0%
7 10804
 
5.7%
8 8933
 
4.7%
6 8619
 
4.6%

accessRights
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:29.389889image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length129
Median length129
Mean length129
Min length129

Characters and Unicode

Total characters2433714
Distinct characters38
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOpen Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj
2nd rowOpen Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj
3rd rowOpen Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj
4th rowOpen Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj
5th rowOpen Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj
ValueCountFrequency (%)
open 18866
11.1%
access 18866
11.1%
http://creativecommons.org/publicdomain/zero/1.0 18866
11.1%
see 18866
11.1%
yale 18866
11.1%
peabody 18866
11.1%
policies 18866
11.1%
at 18866
11.1%
http://hdl.handle.net/10079/8931zqj 18866
11.1%
2025-01-14T11:27:29.506367image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 226392
 
9.3%
/ 188660
 
7.8%
150928
 
6.2%
t 132062
 
5.4%
o 132062
 
5.4%
a 113196
 
4.7%
c 113196
 
4.7%
i 94330
 
3.9%
n 94330
 
3.9%
s 94330
 
3.9%
Other values (28) 1094228
45.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1641342
67.4%
Other Punctuation 358454
 
14.7%
Decimal Number 207526
 
8.5%
Space Separator 150928
 
6.2%
Uppercase Letter 75464
 
3.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 226392
13.8%
t 132062
 
8.0%
o 132062
 
8.0%
a 113196
 
6.9%
c 113196
 
6.9%
i 94330
 
5.7%
n 94330
 
5.7%
s 94330
 
5.7%
l 94330
 
5.7%
p 94330
 
5.7%
Other values (12) 452784
27.6%
Decimal Number
ValueCountFrequency (%)
1 56598
27.3%
0 56598
27.3%
9 37732
18.2%
8 18866
 
9.1%
7 18866
 
9.1%
3 18866
 
9.1%
Other Punctuation
ValueCountFrequency (%)
/ 188660
52.6%
. 75464
 
21.1%
: 56598
 
15.8%
; 18866
 
5.3%
, 18866
 
5.3%
Uppercase Letter
ValueCountFrequency (%)
P 18866
25.0%
O 18866
25.0%
Y 18866
25.0%
A 18866
25.0%
Space Separator
ValueCountFrequency (%)
150928
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1716806
70.5%
Common 716908
29.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 226392
13.2%
t 132062
 
7.7%
o 132062
 
7.7%
a 113196
 
6.6%
c 113196
 
6.6%
i 94330
 
5.5%
n 94330
 
5.5%
s 94330
 
5.5%
l 94330
 
5.5%
p 94330
 
5.5%
Other values (16) 528248
30.8%
Common
ValueCountFrequency (%)
/ 188660
26.3%
150928
21.1%
. 75464
 
10.5%
: 56598
 
7.9%
1 56598
 
7.9%
0 56598
 
7.9%
9 37732
 
5.3%
8 18866
 
2.6%
7 18866
 
2.6%
3 18866
 
2.6%
Other values (2) 37732
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2433714
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 226392
 
9.3%
/ 188660
 
7.8%
150928
 
6.2%
t 132062
 
5.4%
o 132062
 
5.4%
a 113196
 
4.7%
c 113196
 
4.7%
i 94330
 
3.9%
n 94330
 
3.9%
s 94330
 
3.9%
Other values (28) 1094228
45.0%
Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:29.696193image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length62
Median length50
Mean length40.04675077
Min length20

Characters and Unicode

Total characters755522
Distinct characters66
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st rowTamias striatus fisheri (YPM MAM 017903)
2nd rowPeromyscus leucopus noveboracensis (YPM MAM 017889)
3rd rowPeromyscus leucopus noveboracensis (YPM MAM 017897)
4th rowPeromyscus leucopus noveboracensis (YPM MAM 017895)
5th rowPeromyscus leucopus noveboracensis (YPM MAM 017888)
ValueCountFrequency (%)
ypm 18866
 
18.4%
mam 18866
 
18.4%
peromyscus 1837
 
1.8%
cinereus 1489
 
1.5%
sorex 1193
 
1.2%
brevicauda 1125
 
1.1%
blarina 976
 
1.0%
zibethicus 898
 
0.9%
talpoides 868
 
0.8%
gapperi 848
 
0.8%
Other values (20938) 55590
54.2%
2025-01-14T11:27:29.966552image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
83690
 
11.1%
M 58523
 
7.7%
0 44332
 
5.9%
s 41623
 
5.5%
i 36625
 
4.8%
a 35093
 
4.6%
u 30890
 
4.1%
e 30381
 
4.0%
r 26522
 
3.5%
o 25267
 
3.3%
Other values (56) 342576
45.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 370969
49.1%
Uppercase Letter 131912
 
17.5%
Decimal Number 126705
 
16.8%
Space Separator 83690
 
11.1%
Close Punctuation 18866
 
2.5%
Open Punctuation 18866
 
2.5%
Other Punctuation 4512
 
0.6%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 41623
11.2%
i 36625
9.9%
a 35093
9.5%
u 30890
 
8.3%
e 30381
 
8.2%
r 26522
 
7.1%
o 25267
 
6.8%
n 22452
 
6.1%
c 20781
 
5.6%
l 16432
 
4.4%
Other values (16) 84903
22.9%
Uppercase Letter
ValueCountFrequency (%)
M 58523
44.4%
P 21973
 
16.7%
A 19464
 
14.8%
Y 18866
 
14.3%
C 2505
 
1.9%
S 1952
 
1.5%
B 1452
 
1.1%
O 1312
 
1.0%
T 1217
 
0.9%
N 831
 
0.6%
Other values (14) 3817
 
2.9%
Decimal Number
ValueCountFrequency (%)
0 44332
35.0%
1 20061
15.8%
2 9843
 
7.8%
6 8199
 
6.5%
7 8066
 
6.4%
5 8017
 
6.3%
4 7780
 
6.1%
3 7635
 
6.0%
9 6550
 
5.2%
8 6222
 
4.9%
Other Punctuation
ValueCountFrequency (%)
. 4510
> 99.9%
? 2
 
< 0.1%
Space Separator
ValueCountFrequency (%)
83690
100.0%
Close Punctuation
ValueCountFrequency (%)
) 18866
100.0%
Open Punctuation
ValueCountFrequency (%)
( 18866
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 502881
66.6%
Common 252641
33.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 58523
 
11.6%
s 41623
 
8.3%
i 36625
 
7.3%
a 35093
 
7.0%
u 30890
 
6.1%
e 30381
 
6.0%
r 26522
 
5.3%
o 25267
 
5.0%
n 22452
 
4.5%
P 21973
 
4.4%
Other values (40) 173532
34.5%
Common
ValueCountFrequency (%)
83690
33.1%
0 44332
17.5%
1 20061
 
7.9%
) 18866
 
7.5%
( 18866
 
7.5%
2 9843
 
3.9%
6 8199
 
3.2%
7 8066
 
3.2%
5 8017
 
3.2%
4 7780
 
3.1%
Other values (6) 24921
 
9.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 755522
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
83690
 
11.1%
M 58523
 
7.7%
0 44332
 
5.9%
s 41623
 
5.5%
i 36625
 
4.8%
a 35093
 
4.6%
u 30890
 
4.1%
e 30381
 
4.0%
r 26522
 
3.5%
o 25267
 
3.3%
Other values (56) 342576
45.3%

license
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:30.039110image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length49
Median length49
Mean length49
Min length49

Characters and Unicode

Total characters924434
Distinct characters24
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowhttp://creativecommons.org/publicdomain/zero/1.0/
2nd rowhttp://creativecommons.org/publicdomain/zero/1.0/
3rd rowhttp://creativecommons.org/publicdomain/zero/1.0/
4th rowhttp://creativecommons.org/publicdomain/zero/1.0/
5th rowhttp://creativecommons.org/publicdomain/zero/1.0/
ValueCountFrequency (%)
http://creativecommons.org/publicdomain/zero/1.0 18866
100.0%
2025-01-14T11:27:30.152170image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 113196
 
12.2%
o 94330
 
10.2%
m 56598
 
6.1%
c 56598
 
6.1%
r 56598
 
6.1%
e 56598
 
6.1%
t 56598
 
6.1%
i 56598
 
6.1%
. 37732
 
4.1%
n 37732
 
4.1%
Other values (14) 301856
32.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 716908
77.6%
Other Punctuation 169794
 
18.4%
Decimal Number 37732
 
4.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 94330
13.2%
m 56598
 
7.9%
c 56598
 
7.9%
r 56598
 
7.9%
e 56598
 
7.9%
t 56598
 
7.9%
i 56598
 
7.9%
n 37732
 
5.3%
a 37732
 
5.3%
p 37732
 
5.3%
Other values (9) 169794
23.7%
Other Punctuation
ValueCountFrequency (%)
/ 113196
66.7%
. 37732
 
22.2%
: 18866
 
11.1%
Decimal Number
ValueCountFrequency (%)
1 18866
50.0%
0 18866
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 716908
77.6%
Common 207526
 
22.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 94330
13.2%
m 56598
 
7.9%
c 56598
 
7.9%
r 56598
 
7.9%
e 56598
 
7.9%
t 56598
 
7.9%
i 56598
 
7.9%
n 37732
 
5.3%
a 37732
 
5.3%
p 37732
 
5.3%
Other values (9) 169794
23.7%
Common
ValueCountFrequency (%)
/ 113196
54.5%
. 37732
 
18.2%
1 18866
 
9.1%
: 18866
 
9.1%
0 18866
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 924434
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 113196
 
12.2%
o 94330
 
10.2%
m 56598
 
6.1%
c 56598
 
6.1%
r 56598
 
6.1%
e 56598
 
6.1%
t 56598
 
6.1%
i 56598
 
6.1%
. 37732
 
4.1%
n 37732
 
4.1%
Other values (14) 301856
32.7%
Distinct1200
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:30.321586image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length24
Median length24
Mean length24
Min length24

Characters and Unicode

Total characters452784
Distinct characters15
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique667 ?
Unique (%)3.5%

Sample

1st row2024-10-14T12:59:55.000Z
2nd row2024-10-11T19:54:42.000Z
3rd row2024-10-11T19:54:42.000Z
4th row2024-10-11T19:54:42.000Z
5th row2024-10-11T19:54:42.000Z
ValueCountFrequency (%)
2024-09-17t21:33:28.000z 3971
21.0%
2024-10-12t17:36:53.000z 3555
18.8%
2024-09-29t10:06:24.000z 1799
 
9.5%
2024-09-23t19:57:36.000z 1572
 
8.3%
2024-02-19t13:33:41.000z 826
 
4.4%
2024-04-16t21:52:31.000z 553
 
2.9%
2024-04-28t21:51:52.000z 236
 
1.3%
2024-10-22t21:33:57.000z 219
 
1.2%
2023-07-18t22:00:07.000z 158
 
0.8%
2020-12-23t21:50:47.000z 157
 
0.8%
Other values (1190) 5820
30.8%
2025-01-14T11:27:30.556521image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 104437
23.1%
2 68042
15.0%
1 42959
9.5%
- 37732
 
8.3%
: 37732
 
8.3%
3 29809
 
6.6%
4 22029
 
4.9%
T 18866
 
4.2%
. 18866
 
4.2%
Z 18866
 
4.2%
Other values (5) 53446
11.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 320722
70.8%
Other Punctuation 56598
 
12.5%
Dash Punctuation 37732
 
8.3%
Uppercase Letter 37732
 
8.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 104437
32.6%
2 68042
21.2%
1 42959
13.4%
3 29809
 
9.3%
4 22029
 
6.9%
9 13778
 
4.3%
6 11557
 
3.6%
5 11299
 
3.5%
7 11126
 
3.5%
8 5686
 
1.8%
Other Punctuation
ValueCountFrequency (%)
: 37732
66.7%
. 18866
33.3%
Uppercase Letter
ValueCountFrequency (%)
T 18866
50.0%
Z 18866
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 415052
91.7%
Latin 37732
 
8.3%

Most frequent character per script

Common
ValueCountFrequency (%)
0 104437
25.2%
2 68042
16.4%
1 42959
10.4%
- 37732
 
9.1%
: 37732
 
9.1%
3 29809
 
7.2%
4 22029
 
5.3%
. 18866
 
4.5%
9 13778
 
3.3%
6 11557
 
2.8%
Other values (3) 28111
 
6.8%
Latin
ValueCountFrequency (%)
T 18866
50.0%
Z 18866
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 452784
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 104437
23.1%
2 68042
15.0%
1 42959
9.5%
- 37732
 
8.3%
: 37732
 
8.3%
3 29809
 
6.6%
4 22029
 
4.9%
T 18866
 
4.2%
. 18866
 
4.2%
Z 18866
 
4.2%
Other values (5) 53446
11.8%

references
Text

Unique 

Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:30.770984image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length68
Median length64
Mean length64.95473338
Min length64

Characters and Unicode

Total characters1225436
Distinct characters35
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st rowhttp://collections.peabody.yale.edu/search/Record/YPM-MAM-017903
2nd rowhttp://collections.peabody.yale.edu/search/Record/YPM-MAM-017889
3rd rowhttp://collections.peabody.yale.edu/search/Record/YPM-MAM-017897
4th rowhttp://collections.peabody.yale.edu/search/Record/YPM-MAM-017895
5th rowhttp://collections.peabody.yale.edu/search/Record/YPM-MAM-017888
ValueCountFrequency (%)
http://collections.peabody.yale.edu/search/record/ypm-mam-017903 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017835 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017891 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017900 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017899 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017902 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017890 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017901 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017896 1
 
< 0.1%
http://collections.peabody.yale.edu/search/record/ypm-mam-017898 1
 
< 0.1%
Other values (18856) 18856
99.9%
2025-01-14T11:27:30.956396image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 113196
 
9.2%
/ 94330
 
7.7%
c 75464
 
6.2%
o 75464
 
6.2%
. 61101
 
5.0%
M 56598
 
4.6%
t 56598
 
4.6%
l 56598
 
4.6%
d 56598
 
4.6%
a 56598
 
4.6%
Other values (25) 522891
42.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 754640
61.6%
Other Punctuation 174297
 
14.2%
Uppercase Letter 132062
 
10.8%
Decimal Number 126705
 
10.3%
Dash Punctuation 37732
 
3.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 113196
15.0%
c 75464
10.0%
o 75464
10.0%
t 56598
 
7.5%
l 56598
 
7.5%
d 56598
 
7.5%
a 56598
 
7.5%
r 37732
 
5.0%
y 37732
 
5.0%
h 37732
 
5.0%
Other values (6) 150928
20.0%
Decimal Number
ValueCountFrequency (%)
0 44332
35.0%
1 20061
15.8%
2 9843
 
7.8%
6 8199
 
6.5%
7 8066
 
6.4%
5 8017
 
6.3%
4 7780
 
6.1%
3 7635
 
6.0%
9 6550
 
5.2%
8 6222
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
M 56598
42.9%
P 18866
 
14.3%
A 18866
 
14.3%
Y 18866
 
14.3%
R 18866
 
14.3%
Other Punctuation
ValueCountFrequency (%)
/ 94330
54.1%
. 61101
35.1%
: 18866
 
10.8%
Dash Punctuation
ValueCountFrequency (%)
- 37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 886702
72.4%
Common 338734
 
27.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 113196
12.8%
c 75464
 
8.5%
o 75464
 
8.5%
M 56598
 
6.4%
t 56598
 
6.4%
l 56598
 
6.4%
d 56598
 
6.4%
a 56598
 
6.4%
r 37732
 
4.3%
y 37732
 
4.3%
Other values (11) 264124
29.8%
Common
ValueCountFrequency (%)
/ 94330
27.8%
. 61101
18.0%
0 44332
13.1%
- 37732
 
11.1%
1 20061
 
5.9%
: 18866
 
5.6%
2 9843
 
2.9%
6 8199
 
2.4%
7 8066
 
2.4%
5 8017
 
2.4%
Other values (4) 28187
 
8.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1225436
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 113196
 
9.2%
/ 94330
 
7.7%
c 75464
 
6.2%
o 75464
 
6.2%
. 61101
 
5.0%
M 56598
 
4.6%
t 56598
 
4.6%
l 56598
 
4.6%
d 56598
 
4.6%
a 56598
 
4.6%
Other values (25) 522891
42.7%

rightsHolder
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:31.013917image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length19
Median length19
Mean length19
Min length19

Characters and Unicode

Total characters358454
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYale Peabody Museum
2nd rowYale Peabody Museum
3rd rowYale Peabody Museum
4th rowYale Peabody Museum
5th rowYale Peabody Museum
ValueCountFrequency (%)
yale 18866
33.3%
peabody 18866
33.3%
museum 18866
33.3%
2025-01-14T11:27:31.113644image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 56598
15.8%
a 37732
10.5%
37732
10.5%
u 37732
10.5%
Y 18866
 
5.3%
l 18866
 
5.3%
P 18866
 
5.3%
b 18866
 
5.3%
o 18866
 
5.3%
d 18866
 
5.3%
Other values (4) 75464
21.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 264124
73.7%
Uppercase Letter 56598
 
15.8%
Space Separator 37732
 
10.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 56598
21.4%
a 37732
14.3%
u 37732
14.3%
l 18866
 
7.1%
b 18866
 
7.1%
o 18866
 
7.1%
d 18866
 
7.1%
y 18866
 
7.1%
s 18866
 
7.1%
m 18866
 
7.1%
Uppercase Letter
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%
Space Separator
ValueCountFrequency (%)
37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 320722
89.5%
Common 37732
 
10.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 56598
17.6%
a 37732
11.8%
u 37732
11.8%
Y 18866
 
5.9%
l 18866
 
5.9%
P 18866
 
5.9%
b 18866
 
5.9%
o 18866
 
5.9%
d 18866
 
5.9%
y 18866
 
5.9%
Other values (3) 56598
17.6%
Common
ValueCountFrequency (%)
37732
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 358454
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 56598
15.8%
a 37732
10.5%
37732
10.5%
u 37732
10.5%
Y 18866
 
5.3%
l 18866
 
5.3%
P 18866
 
5.3%
b 18866
 
5.3%
o 18866
 
5.3%
d 18866
 
5.3%
Other values (4) 75464
21.1%
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:31.157507image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length1
Median length1
Mean length1
Min length1

Characters and Unicode

Total characters18866
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
0 18321
97.1%
1 545
 
2.9%
2025-01-14T11:27:31.255261image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 18321
97.1%
1 545
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18866
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 18321
97.1%
1 545
 
2.9%

Most occurring scripts

ValueCountFrequency (%)
Common 18866
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 18321
97.1%
1 545
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18866
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 18321
97.1%
1 545
 
2.9%

institutionCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:31.297296image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters56598
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYPM
2nd rowYPM
3rd rowYPM
4th rowYPM
5th rowYPM
ValueCountFrequency (%)
ypm 18866
100.0%
2025-01-14T11:27:31.392794image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 56598
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 56598
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56598
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

collectionCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:31.435454image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters37732
Distinct characters2
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowVZ
2nd rowVZ
3rd rowVZ
4th rowVZ
5th rowVZ
ValueCountFrequency (%)
vz 18866
100.0%
2025-01-14T11:27:31.528341image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
V 18866
50.0%
Z 18866
50.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 37732
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
V 18866
50.0%
Z 18866
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 37732
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
V 18866
50.0%
Z 18866
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37732
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
V 18866
50.0%
Z 18866
50.0%

ownerInstitutionCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:31.569829image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters56598
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowYPM
2nd rowYPM
3rd rowYPM
4th rowYPM
5th rowYPM
ValueCountFrequency (%)
ypm 18866
100.0%
2025-01-14T11:27:31.667732image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 56598
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring scripts

ValueCountFrequency (%)
Latin 56598
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 56598
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
Y 18866
33.3%
P 18866
33.3%
M 18866
33.3%

basisOfRecord
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:31.716275image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length17
Mean length17
Min length17

Characters and Unicode

Total characters320722
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPreservedSpecimen
2nd rowPreservedSpecimen
3rd rowPreservedSpecimen
4th rowPreservedSpecimen
5th rowPreservedSpecimen
ValueCountFrequency (%)
preservedspecimen 18866
100.0%
2025-01-14T11:27:31.820072image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 94330
29.4%
r 37732
 
11.8%
P 18866
 
5.9%
s 18866
 
5.9%
v 18866
 
5.9%
d 18866
 
5.9%
S 18866
 
5.9%
p 18866
 
5.9%
c 18866
 
5.9%
i 18866
 
5.9%
Other values (2) 37732
 
11.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 282990
88.2%
Uppercase Letter 37732
 
11.8%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 94330
33.3%
r 37732
 
13.3%
s 18866
 
6.7%
v 18866
 
6.7%
d 18866
 
6.7%
p 18866
 
6.7%
c 18866
 
6.7%
i 18866
 
6.7%
m 18866
 
6.7%
n 18866
 
6.7%
Uppercase Letter
ValueCountFrequency (%)
P 18866
50.0%
S 18866
50.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 320722
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 94330
29.4%
r 37732
 
11.8%
P 18866
 
5.9%
s 18866
 
5.9%
v 18866
 
5.9%
d 18866
 
5.9%
S 18866
 
5.9%
p 18866
 
5.9%
c 18866
 
5.9%
i 18866
 
5.9%
Other values (2) 37732
 
11.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 320722
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 94330
29.4%
r 37732
 
11.8%
P 18866
 
5.9%
s 18866
 
5.9%
v 18866
 
5.9%
d 18866
 
5.9%
S 18866
 
5.9%
p 18866
 
5.9%
c 18866
 
5.9%
i 18866
 
5.9%
Other values (2) 37732
 
11.8%

dataGeneralizations
Text

Constant  Missing 

Distinct1
Distinct (%)1.5%
Missing18800
Missing (%)99.7%
Memory size147.5 KiB
2025-01-14T11:27:31.867305image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length27
Median length27
Mean length27
Min length27

Characters and Unicode

Total characters1782
Distinct characters14
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCoordinate data unavailable
2nd rowCoordinate data unavailable
3rd rowCoordinate data unavailable
4th rowCoordinate data unavailable
5th rowCoordinate data unavailable
ValueCountFrequency (%)
coordinate 66
33.3%
data 66
33.3%
unavailable 66
33.3%
2025-01-14T11:27:31.969812image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 396
22.2%
o 132
 
7.4%
d 132
 
7.4%
i 132
 
7.4%
n 132
 
7.4%
t 132
 
7.4%
e 132
 
7.4%
132
 
7.4%
l 132
 
7.4%
C 66
 
3.7%
Other values (4) 264
14.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1584
88.9%
Space Separator 132
 
7.4%
Uppercase Letter 66
 
3.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 396
25.0%
o 132
 
8.3%
d 132
 
8.3%
i 132
 
8.3%
n 132
 
8.3%
t 132
 
8.3%
e 132
 
8.3%
l 132
 
8.3%
r 66
 
4.2%
u 66
 
4.2%
Other values (2) 132
 
8.3%
Space Separator
ValueCountFrequency (%)
132
100.0%
Uppercase Letter
ValueCountFrequency (%)
C 66
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1650
92.6%
Common 132
 
7.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 396
24.0%
o 132
 
8.0%
d 132
 
8.0%
i 132
 
8.0%
n 132
 
8.0%
t 132
 
8.0%
e 132
 
8.0%
l 132
 
8.0%
C 66
 
4.0%
r 66
 
4.0%
Other values (3) 198
12.0%
Common
ValueCountFrequency (%)
132
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1782
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 396
22.2%
o 132
 
7.4%
d 132
 
7.4%
i 132
 
7.4%
n 132
 
7.4%
t 132
 
7.4%
e 132
 
7.4%
132
 
7.4%
l 132
 
7.4%
C 66
 
3.7%
Other values (4) 264
14.8%

dynamicProperties
Text

Unique 

Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:32.112643image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length1073
Median length877
Mean length64.79444503
Min length19

Characters and Unicode

Total characters1222412
Distinct characters66
Distinct categories11 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st row{ "irn": "2495311" }
2nd row{ "irn": "2489043", "media": "1223142:2398869c-63eb-410d-8cf8-205d5aacbfcd", "mm_repository_id": "1223142" }
3rd row{ "irn": "2489051", "media": "1223150:ed40315a-fb57-4421-a251-a7ede5b38478", "mm_repository_id": "1223150" }
4th row{ "irn": "2489049", "media": "1223148:3d1eee9f-f1e6-4948-b842-640fbf489e2a", "mm_repository_id": "1223148" }
5th row{ "irn": "2489042", "media": "1223141:56aefa44-5e83-4aec-83f3-b632bc2756cf", "mm_repository_id": "1223141" }
ValueCountFrequency (%)
38111
29.9%
irn 18866
14.8%
solr_long_lat 13323
 
10.5%
original_num 6214
 
4.9%
osteo 4381
 
3.4%
mm_repository_id 455
 
0.4%
media 455
 
0.4%
related_record_links 379
 
0.3%
related_record_types 379
 
0.3%
71.273830,44.049466 311
 
0.2%
Other values (33627) 44501
34.9%
2025-01-14T11:27:32.339491image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
" 160284
 
13.1%
108509
 
8.9%
1 48769
 
4.0%
l 47322
 
3.9%
n 44996
 
3.7%
0 44312
 
3.6%
4 43624
 
3.6%
r 41589
 
3.4%
: 41225
 
3.4%
3 41094
 
3.4%
Other values (56) 600688
49.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 390133
31.9%
Lowercase Letter 325132
26.6%
Other Punctuation 272361
22.3%
Space Separator 108509
 
8.9%
Connector Punctuation 35286
 
2.9%
Uppercase Letter 26260
 
2.1%
Open Punctuation 23249
 
1.9%
Close Punctuation 23247
 
1.9%
Dash Punctuation 18214
 
1.5%
Math Symbol 19
 
< 0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
M 10867
41.4%
O 8766
33.4%
A 5225
19.9%
P 820
 
3.1%
Y 404
 
1.5%
R 105
 
0.4%
H 10
 
< 0.1%
S 8
 
< 0.1%
C 8
 
< 0.1%
E 7
 
< 0.1%
Other values (11) 40
 
0.2%
Lowercase Letter
ValueCountFrequency (%)
l 47322
14.6%
n 44996
13.8%
r 41589
12.8%
o 38912
12.0%
i 33038
10.2%
a 23318
7.2%
g 19538
6.0%
t 19299
5.9%
s 18921
 
5.8%
e 10092
 
3.1%
Other values (10) 28107
8.6%
Decimal Number
ValueCountFrequency (%)
1 48769
12.5%
0 44312
11.4%
4 43624
11.2%
3 41094
10.5%
5 40074
10.3%
7 40008
10.3%
6 38040
9.8%
2 36853
9.4%
9 29256
7.5%
8 28103
7.2%
Other Punctuation
ValueCountFrequency (%)
" 160284
58.8%
: 41225
 
15.1%
. 36322
 
13.3%
, 34528
 
12.7%
/ 1
 
< 0.1%
? 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
{ 18866
81.1%
( 4383
 
18.9%
Close Punctuation
ValueCountFrequency (%)
} 18866
81.2%
) 4381
 
18.8%
Space Separator
ValueCountFrequency (%)
108509
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 35286
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18214
100.0%
Math Symbol
ValueCountFrequency (%)
| 19
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 871020
71.3%
Latin 351392
28.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 47322
13.5%
n 44996
12.8%
r 41589
11.8%
o 38912
11.1%
i 33038
9.4%
a 23318
6.6%
g 19538
 
5.6%
t 19299
 
5.5%
s 18921
 
5.4%
M 10867
 
3.1%
Other values (31) 53592
15.3%
Common
ValueCountFrequency (%)
" 160284
18.4%
108509
12.5%
1 48769
 
5.6%
0 44312
 
5.1%
4 43624
 
5.0%
: 41225
 
4.7%
3 41094
 
4.7%
5 40074
 
4.6%
7 40008
 
4.6%
6 38040
 
4.4%
Other values (15) 265081
30.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1222412
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
" 160284
 
13.1%
108509
 
8.9%
1 48769
 
4.0%
l 47322
 
3.9%
n 44996
 
3.7%
0 44312
 
3.6%
4 43624
 
3.6%
r 41589
 
3.4%
: 41225
 
3.4%
3 41094
 
3.4%
Other values (56) 600688
49.1%

occurrenceID
Text

Unique 

Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:32.457224image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length45
Median length45
Mean length45
Min length45

Characters and Unicode

Total characters848970
Distinct characters22
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st rowurn:uuid:ef710e32-eb63-4875-b9d8-f21a261c1f52
2nd rowurn:uuid:2df9a10d-0595-4c2d-bb13-43b6677a15ce
3rd rowurn:uuid:35474ea7-f956-4872-88c2-a8c56cbe9f90
4th rowurn:uuid:6eaa6b8b-f8a1-44ee-b671-1a734de9ada2
5th rowurn:uuid:b45e450f-3835-46af-be66-6494f44d014e
ValueCountFrequency (%)
urn:uuid:ef710e32-eb63-4875-b9d8-f21a261c1f52 1
 
< 0.1%
urn:uuid:7a7bd1dd-0c61-423e-8d79-316ae9466af3 1
 
< 0.1%
urn:uuid:c2221631-94d5-4364-b7a1-6e8875d768ba 1
 
< 0.1%
urn:uuid:565e73ca-2d43-4f72-bf13-66ca168617ad 1
 
< 0.1%
urn:uuid:8ebc41fa-c154-4c27-a7d4-606e62b2dc95 1
 
< 0.1%
urn:uuid:9ba9abd0-a03f-49c3-97e5-d8a6557c42bd 1
 
< 0.1%
urn:uuid:183dfe30-8155-4c5d-ae5d-15cc0b7ea3b8 1
 
< 0.1%
urn:uuid:fa9cc82d-fccf-4fb9-834c-a5e890e5ff61 1
 
< 0.1%
urn:uuid:b4b795a6-619a-4d62-b9a2-3c911f103ed3 1
 
< 0.1%
urn:uuid:a57c6e6c-6f11-465a-a440-a5953d2cf9d2 1
 
< 0.1%
Other values (18856) 18856
99.9%
2025-01-14T11:27:32.628580image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 75464
 
8.9%
u 56598
 
6.7%
4 54374
 
6.4%
d 54159
 
6.4%
8 40303
 
4.7%
9 40140
 
4.7%
b 40080
 
4.7%
a 39808
 
4.7%
: 37732
 
4.4%
f 35654
 
4.2%
Other values (12) 374658
44.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 382516
45.1%
Lowercase Letter 353258
41.6%
Dash Punctuation 75464
 
8.9%
Other Punctuation 37732
 
4.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u 56598
16.0%
d 54159
15.3%
b 40080
11.3%
a 39808
11.3%
f 35654
10.1%
e 35275
10.0%
c 35086
9.9%
r 18866
 
5.3%
i 18866
 
5.3%
n 18866
 
5.3%
Decimal Number
ValueCountFrequency (%)
4 54374
14.2%
8 40303
10.5%
9 40140
10.5%
1 35621
9.3%
5 35502
9.3%
2 35421
9.3%
7 35401
9.3%
0 35374
9.2%
6 35239
9.2%
3 35141
9.2%
Dash Punctuation
ValueCountFrequency (%)
- 75464
100.0%
Other Punctuation
ValueCountFrequency (%)
: 37732
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 495712
58.4%
Latin 353258
41.6%

Most frequent character per script

Common
ValueCountFrequency (%)
- 75464
15.2%
4 54374
11.0%
8 40303
8.1%
9 40140
8.1%
: 37732
7.6%
1 35621
7.2%
5 35502
7.2%
2 35421
7.1%
7 35401
7.1%
0 35374
7.1%
Other values (2) 70380
14.2%
Latin
ValueCountFrequency (%)
u 56598
16.0%
d 54159
15.3%
b 40080
11.3%
a 39808
11.3%
f 35654
10.1%
e 35275
10.0%
c 35086
9.9%
r 18866
 
5.3%
i 18866
 
5.3%
n 18866
 
5.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 848970
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 75464
 
8.9%
u 56598
 
6.7%
4 54374
 
6.4%
d 54159
 
6.4%
8 40303
 
4.7%
9 40140
 
4.7%
b 40080
 
4.7%
a 39808
 
4.7%
: 37732
 
4.4%
f 35654
 
4.2%
Other values (12) 374658
44.1%

catalogNumber
Text

Unique 

Distinct18866
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:32.844155image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length18
Median length14
Mean length14.95473338
Min length14

Characters and Unicode

Total characters282136
Distinct characters16
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18866 ?
Unique (%)100.0%

Sample

1st rowYPM MAM 017903
2nd rowYPM MAM 017889
3rd rowYPM MAM 017897
4th rowYPM MAM 017895
5th rowYPM MAM 017888
ValueCountFrequency (%)
ypm 18866
33.3%
mam 18866
33.3%
015555.002 1
 
< 0.1%
017813 1
 
< 0.1%
017899 1
 
< 0.1%
017902 1
 
< 0.1%
017890 1
 
< 0.1%
017901 1
 
< 0.1%
017896 1
 
< 0.1%
017898 1
 
< 0.1%
Other values (18858) 18858
33.3%
2025-01-14T11:27:33.127222image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
M 56598
20.1%
0 44332
15.7%
37732
13.4%
1 20061
 
7.1%
Y 18866
 
6.7%
P 18866
 
6.7%
A 18866
 
6.7%
2 9843
 
3.5%
6 8199
 
2.9%
7 8066
 
2.9%
Other values (6) 40707
14.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 126705
44.9%
Uppercase Letter 113196
40.1%
Space Separator 37732
 
13.4%
Other Punctuation 4503
 
1.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 44332
35.0%
1 20061
15.8%
2 9843
 
7.8%
6 8199
 
6.5%
7 8066
 
6.4%
5 8017
 
6.3%
4 7780
 
6.1%
3 7635
 
6.0%
9 6550
 
5.2%
8 6222
 
4.9%
Uppercase Letter
ValueCountFrequency (%)
M 56598
50.0%
Y 18866
 
16.7%
P 18866
 
16.7%
A 18866
 
16.7%
Space Separator
ValueCountFrequency (%)
37732
100.0%
Other Punctuation
ValueCountFrequency (%)
. 4503
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 168940
59.9%
Latin 113196
40.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 44332
26.2%
37732
22.3%
1 20061
11.9%
2 9843
 
5.8%
6 8199
 
4.9%
7 8066
 
4.8%
5 8017
 
4.7%
4 7780
 
4.6%
3 7635
 
4.5%
9 6550
 
3.9%
Other values (2) 10725
 
6.3%
Latin
ValueCountFrequency (%)
M 56598
50.0%
Y 18866
 
16.7%
P 18866
 
16.7%
A 18866
 
16.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 282136
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
M 56598
20.1%
0 44332
15.7%
37732
13.4%
1 20061
 
7.1%
Y 18866
 
6.7%
P 18866
 
6.7%
A 18866
 
6.7%
2 9843
 
3.5%
6 8199
 
2.9%
7 8066
 
2.9%
Other values (6) 40707
14.4%

recordedBy
Text

Missing 

Distinct1050
Distinct (%)7.2%
Missing4296
Missing (%)22.8%
Memory size147.5 KiB
2025-01-14T11:27:33.323780image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length120
Median length80
Mean length16.20549073
Min length3

Characters and Unicode

Total characters236114
Distinct characters69
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique526 ?
Unique (%)3.6%

Sample

1st rowRichard E. Boardman, Kristof Zyskowski
2nd rowRichard E. Boardman
3rd rowLourdes M. Rojas
4th rowRichard E. Boardman
5th rowRichard E. Boardman
ValueCountFrequency (%)
mariko 1875
 
4.7%
yamasaki 1875
 
4.7%
e 1394
 
3.5%
b 1115
 
2.8%
c 1091
 
2.7%
j 1070
 
2.7%
a 867
 
2.2%
ryan 849
 
2.1%
stephens 848
 
2.1%
d 830
 
2.1%
Other values (1289) 28092
70.4%
2025-01-14T11:27:33.598336image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25336
 
10.7%
a 21256
 
9.0%
e 16815
 
7.1%
r 13987
 
5.9%
i 13506
 
5.7%
o 12209
 
5.2%
n 11603
 
4.9%
. 10574
 
4.5%
l 9856
 
4.2%
s 8143
 
3.4%
Other values (59) 92829
39.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 157721
66.8%
Uppercase Letter 40869
 
17.3%
Space Separator 25336
 
10.7%
Other Punctuation 11218
 
4.8%
Decimal Number 552
 
0.2%
Dash Punctuation 416
 
0.2%
Open Punctuation 1
 
< 0.1%
Close Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 21256
13.5%
e 16815
10.7%
r 13987
 
8.9%
i 13506
 
8.6%
o 12209
 
7.7%
n 11603
 
7.4%
l 9856
 
6.2%
s 8143
 
5.2%
t 6641
 
4.2%
m 6447
 
4.1%
Other values (17) 37258
23.6%
Uppercase Letter
ValueCountFrequency (%)
M 4521
 
11.1%
R 4515
 
11.0%
C 3379
 
8.3%
S 2982
 
7.3%
J 2726
 
6.7%
E 2666
 
6.5%
B 2557
 
6.3%
D 2059
 
5.0%
G 1997
 
4.9%
Y 1954
 
4.8%
Other values (15) 11513
28.2%
Decimal Number
ValueCountFrequency (%)
1 190
34.4%
7 70
 
12.7%
8 69
 
12.5%
9 68
 
12.3%
6 64
 
11.6%
2 43
 
7.8%
0 41
 
7.4%
3 7
 
1.3%
Other Punctuation
ValueCountFrequency (%)
. 10574
94.3%
, 565
 
5.0%
& 44
 
0.4%
' 32
 
0.3%
/ 3
 
< 0.1%
Space Separator
ValueCountFrequency (%)
25336
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 416
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 198590
84.1%
Common 37524
 
15.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 21256
 
10.7%
e 16815
 
8.5%
r 13987
 
7.0%
i 13506
 
6.8%
o 12209
 
6.1%
n 11603
 
5.8%
l 9856
 
5.0%
s 8143
 
4.1%
t 6641
 
3.3%
m 6447
 
3.2%
Other values (42) 78127
39.3%
Common
ValueCountFrequency (%)
25336
67.5%
. 10574
28.2%
, 565
 
1.5%
- 416
 
1.1%
1 190
 
0.5%
7 70
 
0.2%
8 69
 
0.2%
9 68
 
0.2%
6 64
 
0.2%
& 44
 
0.1%
Other values (7) 128
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 236040
> 99.9%
None 74
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
25336
 
10.7%
a 21256
 
9.0%
e 16815
 
7.1%
r 13987
 
5.9%
i 13506
 
5.7%
o 12209
 
5.2%
n 11603
 
4.9%
. 10574
 
4.5%
l 9856
 
4.2%
s 8143
 
3.4%
Other values (58) 92755
39.3%
None
ValueCountFrequency (%)
ü 74
100.0%
Distinct13
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:33.657636image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length1
Mean length1.000318032
Min length1

Characters and Unicode

Total characters18872
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)< 0.1%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1
ValueCountFrequency (%)
1 18844
99.9%
2 5
 
< 0.1%
3 4
 
< 0.1%
6 3
 
< 0.1%
11 2
 
< 0.1%
17 1
 
< 0.1%
10 1
 
< 0.1%
4 1
 
< 0.1%
7 1
 
< 0.1%
5 1
 
< 0.1%
Other values (3) 3
 
< 0.1%
2025-01-14T11:27:33.763120image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 18850
99.9%
2 6
 
< 0.1%
3 4
 
< 0.1%
6 3
 
< 0.1%
7 3
 
< 0.1%
5 2
 
< 0.1%
0 1
 
< 0.1%
4 1
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18872
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 18850
99.9%
2 6
 
< 0.1%
3 4
 
< 0.1%
6 3
 
< 0.1%
7 3
 
< 0.1%
5 2
 
< 0.1%
0 1
 
< 0.1%
4 1
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Common 18872
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 18850
99.9%
2 6
 
< 0.1%
3 4
 
< 0.1%
6 3
 
< 0.1%
7 3
 
< 0.1%
5 2
 
< 0.1%
0 1
 
< 0.1%
4 1
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18872
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 18850
99.9%
2 6
 
< 0.1%
3 4
 
< 0.1%
6 3
 
< 0.1%
7 3
 
< 0.1%
5 2
 
< 0.1%
0 1
 
< 0.1%
4 1
 
< 0.1%
8 1
 
< 0.1%
9 1
 
< 0.1%

sex
Text

Missing 

Distinct11
Distinct (%)0.1%
Missing10118
Missing (%)53.6%
Memory size147.5 KiB
2025-01-14T11:27:33.806237image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length15
Median length4
Mean length5.222336534
Min length4

Characters and Unicode

Total characters45685
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)< 0.1%

Sample

1st rowfemale
2nd rowfemale
3rd rowmale
4th rowmale
5th rowfemale
ValueCountFrequency (%)
male 5012
54.4%
female 4182
45.4%
unknown 10
 
0.1%
undeterminable 1
 
< 0.1%
2025-01-14T11:27:33.906292image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 13379
29.3%
m 9195
20.1%
l 9195
20.1%
a 9195
20.1%
f 4182
 
9.2%
457
 
1.0%
n 32
 
0.1%
u 11
 
< 0.1%
w 10
 
< 0.1%
k 10
 
< 0.1%
Other values (7) 19
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 45224
99.0%
Space Separator 457
 
1.0%
Other Punctuation 4
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 13379
29.6%
m 9195
20.3%
l 9195
20.3%
a 9195
20.3%
f 4182
 
9.2%
n 32
 
0.1%
u 11
 
< 0.1%
w 10
 
< 0.1%
k 10
 
< 0.1%
o 10
 
< 0.1%
Other values (5) 5
 
< 0.1%
Space Separator
ValueCountFrequency (%)
457
100.0%
Other Punctuation
ValueCountFrequency (%)
? 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 45224
99.0%
Common 461
 
1.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 13379
29.6%
m 9195
20.3%
l 9195
20.3%
a 9195
20.3%
f 4182
 
9.2%
n 32
 
0.1%
u 11
 
< 0.1%
w 10
 
< 0.1%
k 10
 
< 0.1%
o 10
 
< 0.1%
Other values (5) 5
 
< 0.1%
Common
ValueCountFrequency (%)
457
99.1%
? 4
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 45685
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 13379
29.3%
m 9195
20.1%
l 9195
20.1%
a 9195
20.1%
f 4182
 
9.2%
457
 
1.0%
n 32
 
0.1%
u 11
 
< 0.1%
w 10
 
< 0.1%
k 10
 
< 0.1%
Other values (7) 19
 
< 0.1%

lifeStage
Text

Missing 

Distinct16
Distinct (%)1.7%
Missing17900
Missing (%)94.9%
Memory size147.5 KiB
2025-01-14T11:27:33.954534image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length5
Mean length6.195652174
Min length5

Characters and Unicode

Total characters5985
Distinct characters19
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.5%

Sample

1st rowadult
2nd rowadult
3rd rowadult
4th rowadult
5th rowadult
ValueCountFrequency (%)
adult 583
57.0%
juvenile 203
 
19.9%
young 149
 
14.6%
immature 29
 
2.8%
neonate 26
 
2.5%
subadult 23
 
2.3%
fetal 7
 
0.7%
embryo 2
 
0.2%
2025-01-14T11:27:34.062461image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
u 1010
16.9%
l 816
13.6%
a 668
11.2%
t 668
11.2%
d 606
10.1%
e 496
8.3%
n 404
 
6.8%
i 232
 
3.9%
v 203
 
3.4%
j 203
 
3.4%
Other values (9) 679
11.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 5929
99.1%
Space Separator 56
 
0.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u 1010
17.0%
l 816
13.8%
a 668
11.3%
t 668
11.3%
d 606
10.2%
e 496
8.4%
n 404
 
6.8%
i 232
 
3.9%
v 203
 
3.4%
j 203
 
3.4%
Other values (8) 623
10.5%
Space Separator
ValueCountFrequency (%)
56
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5929
99.1%
Common 56
 
0.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
u 1010
17.0%
l 816
13.8%
a 668
11.3%
t 668
11.3%
d 606
10.2%
e 496
8.4%
n 404
 
6.8%
i 232
 
3.9%
v 203
 
3.4%
j 203
 
3.4%
Other values (8) 623
10.5%
Common
ValueCountFrequency (%)
56
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5985
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
u 1010
16.9%
l 816
13.6%
a 668
11.2%
t 668
11.2%
d 606
10.1%
e 496
8.3%
n 404
 
6.8%
i 232
 
3.9%
v 203
 
3.4%
j 203
 
3.4%
Other values (9) 679
11.3%

reproductiveCondition
Text

Missing 

Distinct626
Distinct (%)27.3%
Missing16576
Missing (%)87.9%
Memory size147.5 KiB
2025-01-14T11:27:34.212233image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length166
Median length116
Mean length12.40349345
Min length2

Characters and Unicode

Total characters28404
Distinct characters70
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique457 ?
Unique (%)20.0%

Sample

1st rowtestes 5 x 2 mm
2nd rowEMB; 6; 10x8
3rd rowSCR; L=6x4
4th rowSCR R=8x5
5th rowEMB; L=4; R=2, 14X18
ValueCountFrequency (%)
testes 1006
16.2%
mm 877
 
14.1%
embryo 650
 
10.4%
no 643
 
10.3%
3 151
 
2.4%
2 137
 
2.2%
embryos 137
 
2.2%
lactating 137
 
2.2%
4 135
 
2.2%
5 111
 
1.8%
Other values (469) 2242
36.0%
2025-01-14T11:27:34.441696image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3936
13.9%
e 3252
11.4%
m 2969
 
10.5%
s 2466
 
8.7%
t 2401
 
8.5%
o 1669
 
5.9%
r 1048
 
3.7%
n 966
 
3.4%
b 824
 
2.9%
y 821
 
2.9%
Other values (60) 8052
28.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 18999
66.9%
Space Separator 3936
 
13.9%
Decimal Number 2531
 
8.9%
Uppercase Letter 1676
 
5.9%
Other Punctuation 737
 
2.6%
Math Symbol 248
 
0.9%
Dash Punctuation 229
 
0.8%
Open Punctuation 24
 
0.1%
Close Punctuation 24
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 3252
17.1%
m 2969
15.6%
s 2466
13.0%
t 2401
12.6%
o 1669
8.8%
r 1048
 
5.5%
n 966
 
5.1%
b 824
 
4.3%
y 821
 
4.3%
a 601
 
3.2%
Other values (15) 1982
10.4%
Uppercase Letter
ValueCountFrequency (%)
R 354
21.1%
T 280
16.7%
L 194
11.6%
S 157
9.4%
C 141
 
8.4%
P 128
 
7.6%
N 125
 
7.5%
A 86
 
5.1%
E 70
 
4.2%
B 49
 
2.9%
Other values (8) 92
 
5.5%
Decimal Number
ValueCountFrequency (%)
5 466
18.4%
1 447
17.7%
2 372
14.7%
3 348
13.7%
4 251
9.9%
0 199
7.9%
6 162
 
6.4%
7 104
 
4.1%
8 104
 
4.1%
9 78
 
3.1%
Other Punctuation
ValueCountFrequency (%)
. 348
47.2%
, 218
29.6%
; 113
 
15.3%
: 39
 
5.3%
" 7
 
0.9%
& 5
 
0.7%
? 3
 
0.4%
/ 3
 
0.4%
' 1
 
0.1%
Math Symbol
ValueCountFrequency (%)
= 236
95.2%
+ 10
 
4.0%
~ 1
 
0.4%
> 1
 
0.4%
Space Separator
ValueCountFrequency (%)
3936
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 229
100.0%
Open Punctuation
ValueCountFrequency (%)
( 24
100.0%
Close Punctuation
ValueCountFrequency (%)
) 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 20675
72.8%
Common 7729
 
27.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 3252
15.7%
m 2969
14.4%
s 2466
11.9%
t 2401
11.6%
o 1669
8.1%
r 1048
 
5.1%
n 966
 
4.7%
b 824
 
4.0%
y 821
 
4.0%
a 601
 
2.9%
Other values (33) 3658
17.7%
Common
ValueCountFrequency (%)
3936
50.9%
5 466
 
6.0%
1 447
 
5.8%
2 372
 
4.8%
3 348
 
4.5%
. 348
 
4.5%
4 251
 
3.2%
= 236
 
3.1%
- 229
 
3.0%
, 218
 
2.8%
Other values (17) 878
 
11.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 28404
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3936
13.9%
e 3252
11.4%
m 2969
 
10.5%
s 2466
 
8.7%
t 2401
 
8.5%
o 1669
 
5.9%
r 1048
 
3.7%
n 966
 
3.4%
b 824
 
2.9%
y 821
 
2.9%
Other values (60) 8052
28.3%

behavior
Text

Missing 

Distinct2
Distinct (%)100.0%
Missing18864
Missing (%)> 99.9%
Memory size147.5 KiB
2025-01-14T11:27:34.513592image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length64
Median length56.5
Mean length56.5
Min length49

Characters and Unicode

Total characters113
Distinct characters27
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowwas calling while hanging from a 0.5 m tall shrub
2nd rowwas day-roosting in a dense subcanopy tree ca. 15 m above ground
ValueCountFrequency (%)
was 2
 
9.1%
a 2
 
9.1%
m 2
 
9.1%
in 1
 
4.5%
above 1
 
4.5%
15 1
 
4.5%
ca 1
 
4.5%
tree 1
 
4.5%
subcanopy 1
 
4.5%
dense 1
 
4.5%
Other values (9) 9
40.9%
2025-01-14T11:27:34.643536image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
20
17.7%
a 11
 
9.7%
n 8
 
7.1%
o 6
 
5.3%
s 6
 
5.3%
e 6
 
5.3%
l 5
 
4.4%
i 5
 
4.4%
g 5
 
4.4%
r 5
 
4.4%
Other values (17) 36
31.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 86
76.1%
Space Separator 20
 
17.7%
Decimal Number 4
 
3.5%
Other Punctuation 2
 
1.8%
Dash Punctuation 1
 
0.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 11
12.8%
n 8
 
9.3%
o 6
 
7.0%
s 6
 
7.0%
e 6
 
7.0%
l 5
 
5.8%
i 5
 
5.8%
g 5
 
5.8%
r 5
 
5.8%
t 3
 
3.5%
Other values (11) 26
30.2%
Decimal Number
ValueCountFrequency (%)
5 2
50.0%
0 1
25.0%
1 1
25.0%
Space Separator
ValueCountFrequency (%)
20
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 86
76.1%
Common 27
 
23.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 11
12.8%
n 8
 
9.3%
o 6
 
7.0%
s 6
 
7.0%
e 6
 
7.0%
l 5
 
5.8%
i 5
 
5.8%
g 5
 
5.8%
r 5
 
5.8%
t 3
 
3.5%
Other values (11) 26
30.2%
Common
ValueCountFrequency (%)
20
74.1%
. 2
 
7.4%
5 2
 
7.4%
0 1
 
3.7%
- 1
 
3.7%
1 1
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 113
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
20
17.7%
a 11
 
9.7%
n 8
 
7.1%
o 6
 
5.3%
s 6
 
5.3%
e 6
 
5.3%
l 5
 
4.4%
i 5
 
4.4%
g 5
 
4.4%
r 5
 
4.4%
Other values (17) 36
31.9%

preparations
Text

Missing 

Distinct1019
Distinct (%)5.5%
Missing349
Missing (%)1.8%
Memory size147.5 KiB
2025-01-14T11:27:34.822143image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length262
Median length190
Mean length25.19781822
Min length4

Characters and Unicode

Total characters466588
Distinct characters80
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique762 ?
Unique (%)4.1%

Sample

1st rowskin, round; skull; tissue (frozen)
2nd rowtissue (frozen)
3rd rowtissue (frozen)
4th rowtissue (frozen)
5th rowtissue (frozen)
ValueCountFrequency (%)
skeleton 13111
20.4%
skull 8315
12.9%
only 7454
11.6%
skin 6927
10.8%
round 5887
9.2%
tissue 4575
 
7.1%
frozen 4435
 
6.9%
incomplete 1443
 
2.2%
alc 1212
 
1.9%
10 1172
 
1.8%
Other values (1014) 9705
15.1%
2025-01-14T11:27:35.079536image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
45719
 
9.8%
n 43643
 
9.4%
e 42409
 
9.1%
l 42371
 
9.1%
s 40001
 
8.6%
o 35814
 
7.7%
k 28618
 
6.1%
t 22389
 
4.8%
u 19801
 
4.2%
i 16215
 
3.5%
Other values (70) 129608
27.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 352064
75.5%
Space Separator 45719
 
9.8%
Other Punctuation 21809
 
4.7%
Close Punctuation 15985
 
3.4%
Open Punctuation 15984
 
3.4%
Decimal Number 7890
 
1.7%
Uppercase Letter 4747
 
1.0%
Dash Punctuation 1217
 
0.3%
Math Symbol 1173
 
0.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 43643
12.4%
e 42409
12.0%
l 42371
12.0%
s 40001
11.4%
o 35814
10.2%
k 28618
8.1%
t 22389
6.4%
u 19801
5.6%
i 16215
 
4.6%
r 13228
 
3.8%
Other values (16) 47575
13.5%
Uppercase Letter
ValueCountFrequency (%)
C 1139
24.0%
S 633
13.3%
L 485
10.2%
O 365
 
7.7%
M 258
 
5.4%
R 249
 
5.2%
I 230
 
4.8%
T 229
 
4.8%
E 213
 
4.5%
D 125
 
2.6%
Other values (16) 821
17.3%
Other Punctuation
ValueCountFrequency (%)
; 7821
35.9%
, 7453
34.2%
. 3527
16.2%
% 2384
 
10.9%
/ 316
 
1.4%
" 272
 
1.2%
& 18
 
0.1%
' 10
 
< 0.1%
? 6
 
< 0.1%
: 2
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
0 2841
36.0%
1 1930
24.5%
7 1409
17.9%
3 690
 
8.7%
2 329
 
4.2%
5 229
 
2.9%
4 186
 
2.4%
6 112
 
1.4%
8 99
 
1.3%
9 65
 
0.8%
Close Punctuation
ValueCountFrequency (%)
) 15984
> 99.9%
] 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 15983
> 99.9%
[ 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
> 1172
99.9%
+ 1
 
0.1%
Space Separator
ValueCountFrequency (%)
45719
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1217
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 356811
76.5%
Common 109777
 
23.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 43643
12.2%
e 42409
11.9%
l 42371
11.9%
s 40001
11.2%
o 35814
10.0%
k 28618
8.0%
t 22389
6.3%
u 19801
 
5.5%
i 16215
 
4.5%
r 13228
 
3.7%
Other values (42) 52322
14.7%
Common
ValueCountFrequency (%)
45719
41.6%
) 15984
 
14.6%
( 15983
 
14.6%
; 7821
 
7.1%
, 7453
 
6.8%
. 3527
 
3.2%
0 2841
 
2.6%
% 2384
 
2.2%
1 1930
 
1.8%
7 1409
 
1.3%
Other values (18) 4726
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 466588
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
45719
 
9.8%
n 43643
 
9.4%
e 42409
 
9.1%
l 42371
 
9.1%
s 40001
 
8.6%
o 35814
 
7.7%
k 28618
 
6.1%
t 22389
 
4.8%
u 19801
 
4.2%
i 16215
 
3.5%
Other values (70) 129608
27.8%
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:35.138423image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length12.98484045
Min length7

Characters and Unicode

Total characters244972
Distinct characters14
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowin collection
2nd rowin collection
3rd rowin collection
4th rowin collection
5th rowin collection
ValueCountFrequency (%)
in 18804
49.8%
collection 18804
49.8%
on 62
 
0.2%
loan 38
 
0.1%
not 14
 
< 0.1%
view 14
 
< 0.1%
exhibit 10
 
< 0.1%
2025-01-14T11:27:35.247482image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
n 37722
15.4%
o 37722
15.4%
l 37646
15.4%
i 37642
15.4%
c 37608
15.4%
18880
7.7%
e 18828
7.7%
t 18828
7.7%
a 38
 
< 0.1%
v 14
 
< 0.1%
Other values (4) 44
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 226092
92.3%
Space Separator 18880
 
7.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n 37722
16.7%
o 37722
16.7%
l 37646
16.7%
i 37642
16.6%
c 37608
16.6%
e 18828
8.3%
t 18828
8.3%
a 38
 
< 0.1%
v 14
 
< 0.1%
w 14
 
< 0.1%
Other values (3) 30
 
< 0.1%
Space Separator
ValueCountFrequency (%)
18880
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 226092
92.3%
Common 18880
 
7.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
n 37722
16.7%
o 37722
16.7%
l 37646
16.7%
i 37642
16.6%
c 37608
16.6%
e 18828
8.3%
t 18828
8.3%
a 38
 
< 0.1%
v 14
 
< 0.1%
w 14
 
< 0.1%
Other values (3) 30
 
< 0.1%
Common
ValueCountFrequency (%)
18880
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 244972
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n 37722
15.4%
o 37722
15.4%
l 37646
15.4%
i 37642
15.4%
c 37608
15.4%
18880
7.7%
e 18828
7.7%
t 18828
7.7%
a 38
 
< 0.1%
v 14
 
< 0.1%
Other values (4) 44
 
< 0.1%

associatedMedia
Text

Missing 

Distinct455
Distinct (%)100.0%
Missing18411
Missing (%)97.6%
Memory size147.5 KiB
2025-01-14T11:27:35.367718image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length113
Median length113
Mean length110.4417582
Min length107

Characters and Unicode

Total characters50251
Distinct characters35
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique455 ?
Unique (%)100.0%

Sample

1st rowhttps://images.collections.yale.edu/iiif/2/ypm:2398869c-63eb-410d-8cf8-205d5aacbfcd/full/!1920,1920/0/default.jpg
2nd rowhttps://images.collections.yale.edu/iiif/2/ypm:ed40315a-fb57-4421-a251-a7ede5b38478/full/!1920,1920/0/default.jpg
3rd rowhttps://images.collections.yale.edu/iiif/2/ypm:3d1eee9f-f1e6-4948-b842-640fbf489e2a/full/!1920,1920/0/default.jpg
4th rowhttps://images.collections.yale.edu/iiif/2/ypm:56aefa44-5e83-4aec-83f3-b632bc2756cf/full/!1920,1920/0/default.jpg
5th rowhttps://images.collections.yale.edu/iiif/2/ypm:ebedb256-ea73-46ab-ae98-27dbcdccc9d5/full/!1920,1920/0/default.jpg
ValueCountFrequency (%)
https://images.collections.yale.edu/iiif/2/ypm:4ef97955-2b97-4cd3-a7fb-0a03a196b4dd/full/!1920,1920/0/default.jpg 1
 
0.2%
https://images.collections.yale.edu/iiif/2/ypm:de7f1b1a-a7d4-4670-bc54-9849bb3d9d8c/full/full/0/default.jpg 1
 
0.2%
https://images.collections.yale.edu/iiif/2/ypm:ed40315a-fb57-4421-a251-a7ede5b38478/full/!1920,1920/0/default.jpg 1
 
0.2%
https://images.collections.yale.edu/iiif/2/ypm:3d1eee9f-f1e6-4948-b842-640fbf489e2a/full/!1920,1920/0/default.jpg 1
 
0.2%
https://images.collections.yale.edu/iiif/2/ypm:56aefa44-5e83-4aec-83f3-b632bc2756cf/full/!1920,1920/0/default.jpg 1
 
0.2%
https://images.collections.yale.edu/iiif/2/ypm:ebedb256-ea73-46ab-ae98-27dbcdccc9d5/full/!1920,1920/0/default.jpg 1
 
0.2%
https://images.collections.yale.edu/iiif/2/ypm:67a707fa-ae74-4349-ad09-2e55a1a5589e/full/!1920,1920/0/default.jpg 1
 
0.2%
https://images.collections.yale.edu/iiif/2/ypm:7d320417-0c05-49e7-9e7e-72deae2280ad/full/!1920,1920/0/default.jpg 1
 
0.2%
https://images.collections.yale.edu/iiif/2/ypm:799d4b82-3a70-4f4e-af19-38e53ab2c90f/full/!1920,1920/0/default.jpg 1
 
0.2%
https://images.collections.yale.edu/iiif/2/ypm:7338d268-6b2a-4d3f-8caf-73ad8e7818f6/full/!1920,1920/0/default.jpg 1
 
0.2%
Other values (445) 445
97.8%
2025-01-14T11:27:35.555642image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 4095
 
8.1%
l 3118
 
6.2%
e 3113
 
6.2%
f 2407
 
4.8%
a 2326
 
4.6%
i 2275
 
4.5%
2 1846
 
3.7%
. 1820
 
3.6%
- 1820
 
3.6%
t 1820
 
3.6%
Other values (25) 25611
51.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 28809
57.3%
Decimal Number 12275
24.4%
Other Punctuation 7347
 
14.6%
Dash Punctuation 1820
 
3.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l 3118
10.8%
e 3113
10.8%
f 2407
 
8.4%
a 2326
 
8.1%
i 2275
 
7.9%
t 1820
 
6.3%
c 1768
 
6.1%
d 1740
 
6.0%
u 1559
 
5.4%
s 1365
 
4.7%
Other values (9) 7318
25.4%
Decimal Number
ValueCountFrequency (%)
2 1846
15.0%
0 1811
14.8%
9 1501
12.2%
1 1394
11.4%
4 1352
11.0%
8 934
7.6%
3 897
7.3%
7 860
7.0%
6 860
7.0%
5 820
6.7%
Other Punctuation
ValueCountFrequency (%)
/ 4095
55.7%
. 1820
24.8%
: 910
 
12.4%
! 261
 
3.6%
, 261
 
3.6%
Dash Punctuation
ValueCountFrequency (%)
- 1820
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 28809
57.3%
Common 21442
42.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 3118
10.8%
e 3113
10.8%
f 2407
 
8.4%
a 2326
 
8.1%
i 2275
 
7.9%
t 1820
 
6.3%
c 1768
 
6.1%
d 1740
 
6.0%
u 1559
 
5.4%
s 1365
 
4.7%
Other values (9) 7318
25.4%
Common
ValueCountFrequency (%)
/ 4095
19.1%
2 1846
8.6%
. 1820
8.5%
- 1820
8.5%
0 1811
8.4%
9 1501
 
7.0%
1 1394
 
6.5%
4 1352
 
6.3%
8 934
 
4.4%
: 910
 
4.2%
Other values (6) 3959
18.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 50251
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 4095
 
8.1%
l 3118
 
6.2%
e 3113
 
6.2%
f 2407
 
4.8%
a 2326
 
4.6%
i 2275
 
4.5%
2 1846
 
3.7%
. 1820
 
3.6%
- 1820
 
3.6%
t 1820
 
3.6%
Other values (25) 25611
51.0%

associatedReferences
Text

Missing 

Distinct178
Distinct (%)2.8%
Missing12450
Missing (%)66.0%
Memory size147.5 KiB
2025-01-14T11:27:35.681035image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length116
Median length1
Mean length8.085099751
Min length1

Characters and Unicode

Total characters51874
Distinct characters65
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique75 ?
Unique (%)1.2%

Sample

1st row|
2nd row|
3rd row|
4th row|
5th row|
ValueCountFrequency (%)
4933
37.6%
by 1565
 
11.9%
det 1461
 
11.1%
kristof 303
 
2.3%
jordan 300
 
2.3%
g 300
 
2.3%
colosi 300
 
2.3%
a 296
 
2.3%
zyskowski 291
 
2.2%
mary 288
 
2.2%
Other values (171) 3078
23.5%
2025-01-14T11:27:35.878700image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
| 7591
 
14.6%
6699
 
12.9%
e 2799
 
5.4%
. 2603
 
5.0%
r 2373
 
4.6%
y 2262
 
4.4%
t 2241
 
4.3%
o 2076
 
4.0%
b 1743
 
3.4%
D 1738
 
3.4%
Other values (55) 19749
38.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 23551
45.4%
Math Symbol 7591
 
14.6%
Space Separator 6699
 
12.9%
Uppercase Letter 5999
 
11.6%
Other Punctuation 4176
 
8.1%
Decimal Number 3818
 
7.4%
Dash Punctuation 40
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 2799
11.9%
r 2373
10.1%
y 2262
9.6%
t 2241
9.5%
o 2076
8.8%
b 1743
7.4%
s 1614
 
6.9%
i 1410
 
6.0%
a 1357
 
5.8%
n 1339
 
5.7%
Other values (15) 4337
18.4%
Uppercase Letter
ValueCountFrequency (%)
D 1738
29.0%
A 601
 
10.0%
K 503
 
8.4%
C 448
 
7.5%
J 435
 
7.3%
M 426
 
7.1%
G 353
 
5.9%
T 326
 
5.4%
Z 303
 
5.1%
N 129
 
2.2%
Other values (13) 737
12.3%
Decimal Number
ValueCountFrequency (%)
0 1658
43.4%
2 1138
29.8%
8 277
 
7.3%
9 275
 
7.2%
1 251
 
6.6%
7 132
 
3.5%
6 33
 
0.9%
4 25
 
0.7%
3 21
 
0.6%
5 8
 
0.2%
Other Punctuation
ValueCountFrequency (%)
. 2603
62.3%
: 1569
37.6%
; 3
 
0.1%
, 1
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
| 7591
100.0%
Space Separator
ValueCountFrequency (%)
6699
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 40
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 29550
57.0%
Common 22324
43.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 2799
 
9.5%
r 2373
 
8.0%
y 2262
 
7.7%
t 2241
 
7.6%
o 2076
 
7.0%
b 1743
 
5.9%
D 1738
 
5.9%
s 1614
 
5.5%
i 1410
 
4.8%
a 1357
 
4.6%
Other values (38) 9937
33.6%
Common
ValueCountFrequency (%)
| 7591
34.0%
6699
30.0%
. 2603
 
11.7%
0 1658
 
7.4%
: 1569
 
7.0%
2 1138
 
5.1%
8 277
 
1.2%
9 275
 
1.2%
1 251
 
1.1%
7 132
 
0.6%
Other values (7) 131
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 51873
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
| 7591
 
14.6%
6699
 
12.9%
e 2799
 
5.4%
. 2603
 
5.0%
r 2373
 
4.6%
y 2262
 
4.4%
t 2241
 
4.3%
o 2076
 
4.0%
b 1743
 
3.4%
D 1738
 
3.4%
Other values (54) 19748
38.1%
None
ValueCountFrequency (%)
é 1
100.0%

associatedTaxa
Text

Missing 

Distinct373
Distinct (%)98.4%
Missing18487
Missing (%)98.0%
Memory size147.5 KiB
2025-01-14T11:27:36.073482image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length131
Median length10
Mean length13.77572559
Min length10

Characters and Unicode

Total characters5221
Distinct characters44
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique371 ?
Unique (%)97.9%

Sample

1st rowENT.013766
2nd rowoffspring: MAM.015755
3rd rowparent: MAM.015754
4th rowMAM.001438
5th rowMAM.004953
ValueCountFrequency (%)
part 39
 
6.9%
same 36
 
6.3%
specimen 36
 
6.3%
of 36
 
6.3%
other 8
 
1.4%
parent 7
 
1.2%
mam.012670 6
 
1.1%
skeleton 3
 
0.5%
mam.013246|part 3
 
0.5%
mam.013247|part 3
 
0.5%
Other values (381) 392
68.9%
2025-01-14T11:27:36.456872image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 884
16.9%
M 775
14.8%
. 402
 
7.7%
A 391
 
7.5%
1 344
 
6.6%
3 195
 
3.7%
190
 
3.6%
9 173
 
3.3%
2 166
 
3.2%
5 148
 
2.8%
Other values (34) 1553
29.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2394
45.9%
Uppercase Letter 1206
23.1%
Lowercase Letter 902
 
17.3%
Other Punctuation 510
 
9.8%
Space Separator 190
 
3.6%
Math Symbol 19
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 130
14.4%
p 100
11.1%
a 98
10.9%
s 85
9.4%
r 72
8.0%
m 72
8.0%
t 70
7.8%
n 59
6.5%
o 54
6.0%
f 50
 
5.5%
Other values (7) 112
12.4%
Uppercase Letter
ValueCountFrequency (%)
M 775
64.3%
A 391
32.4%
H 7
 
0.6%
E 6
 
0.5%
X 5
 
0.4%
Y 5
 
0.4%
P 5
 
0.4%
R 5
 
0.4%
T 3
 
0.2%
S 2
 
0.2%
Other values (2) 2
 
0.2%
Decimal Number
ValueCountFrequency (%)
0 884
36.9%
1 344
 
14.4%
3 195
 
8.1%
9 173
 
7.2%
2 166
 
6.9%
5 148
 
6.2%
4 133
 
5.6%
6 124
 
5.2%
8 114
 
4.8%
7 113
 
4.7%
Other Punctuation
ValueCountFrequency (%)
. 402
78.8%
: 75
 
14.7%
? 33
 
6.5%
Space Separator
ValueCountFrequency (%)
190
100.0%
Math Symbol
ValueCountFrequency (%)
| 19
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3113
59.6%
Latin 2108
40.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 775
36.8%
A 391
18.5%
e 130
 
6.2%
p 100
 
4.7%
a 98
 
4.6%
s 85
 
4.0%
r 72
 
3.4%
m 72
 
3.4%
t 70
 
3.3%
n 59
 
2.8%
Other values (19) 256
 
12.1%
Common
ValueCountFrequency (%)
0 884
28.4%
. 402
12.9%
1 344
 
11.1%
3 195
 
6.3%
190
 
6.1%
9 173
 
5.6%
2 166
 
5.3%
5 148
 
4.8%
4 133
 
4.3%
6 124
 
4.0%
Other values (5) 354
11.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5221
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 884
16.9%
M 775
14.8%
. 402
 
7.7%
A 391
 
7.5%
1 344
 
6.6%
3 195
 
3.7%
190
 
3.6%
9 173
 
3.3%
2 166
 
3.2%
5 148
 
2.8%
Other values (34) 1553
29.7%

otherCatalogNumbers
Text

Missing 

Distinct6197
Distinct (%)99.7%
Missing12652
Missing (%)67.1%
Memory size147.5 KiB
2025-01-14T11:27:36.657574image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length232
Median length128
Mean length20.18828452
Min length3

Characters and Unicode

Total characters125450
Distinct characters55
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique6180 ?
Unique (%)99.5%

Sample

1st rowOsteo 12753 (MAM.O.12753)
2nd rowOsteo 2583 (MAM.O.02583)
3rd rowOsteo 3875 (MAM.O.03875)
4th rowVP.061504
5th rowUAM 112553
ValueCountFrequency (%)
osteo 4594
28.9%
11593 14
 
0.1%
m 6
 
< 0.1%
dcm 5
 
< 0.1%
uam 5
 
< 0.1%
s 5
 
< 0.1%
3978 4
 
< 0.1%
295 3
 
< 0.1%
10180 3
 
< 0.1%
14460 3
 
< 0.1%
Other values (10931) 11236
70.8%
2025-01-14T11:27:36.919988image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
. 10298
 
8.2%
M 10211
 
8.1%
9664
 
7.7%
O 9191
 
7.3%
1 8952
 
7.1%
0 8355
 
6.7%
3 6022
 
4.8%
4 5451
 
4.3%
2 5128
 
4.1%
A 5092
 
4.1%
Other values (45) 47086
37.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 51869
41.3%
Uppercase Letter 25108
20.0%
Lowercase Letter 18707
 
14.9%
Other Punctuation 10738
 
8.6%
Space Separator 9664
 
7.7%
Open Punctuation 4595
 
3.7%
Close Punctuation 4593
 
3.7%
Dash Punctuation 176
 
0.1%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
M 10211
40.7%
O 9191
36.6%
A 5092
20.3%
P 463
 
1.8%
R 100
 
0.4%
C 9
 
< 0.1%
D 6
 
< 0.1%
S 6
 
< 0.1%
Z 6
 
< 0.1%
U 5
 
< 0.1%
Other values (10) 19
 
0.1%
Lowercase Letter
ValueCountFrequency (%)
e 4601
24.6%
s 4598
24.6%
o 4597
24.6%
t 4597
24.6%
m 147
 
0.8%
a 83
 
0.4%
p 72
 
0.4%
l 2
 
< 0.1%
r 2
 
< 0.1%
c 2
 
< 0.1%
Other values (6) 6
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 8952
17.3%
0 8355
16.1%
3 6022
11.6%
4 5451
10.5%
2 5128
9.9%
5 4090
7.9%
9 3811
7.3%
6 3503
 
6.8%
7 3393
 
6.5%
8 3164
 
6.1%
Other Punctuation
ValueCountFrequency (%)
. 10298
95.9%
; 436
 
4.1%
" 2
 
< 0.1%
? 1
 
< 0.1%
/ 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
9664
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4595
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4593
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 176
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 81635
65.1%
Latin 43815
34.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
M 10211
23.3%
O 9191
21.0%
A 5092
11.6%
e 4601
10.5%
s 4598
10.5%
o 4597
10.5%
t 4597
10.5%
P 463
 
1.1%
m 147
 
0.3%
R 100
 
0.2%
Other values (26) 218
 
0.5%
Common
ValueCountFrequency (%)
. 10298
12.6%
9664
11.8%
1 8952
11.0%
0 8355
10.2%
3 6022
7.4%
4 5451
 
6.7%
2 5128
 
6.3%
( 4595
 
5.6%
) 4593
 
5.6%
5 4090
 
5.0%
Other values (9) 14487
17.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 125450
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
. 10298
 
8.2%
M 10211
 
8.1%
9664
 
7.7%
O 9191
 
7.3%
1 8952
 
7.1%
0 8355
 
6.7%
3 6022
 
4.8%
4 5451
 
4.3%
2 5128
 
4.1%
A 5092
 
4.1%
Other values (45) 47086
37.5%
Distinct18842
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:37.122359image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length654
Median length580
Mean length69.48706668
Min length13

Characters and Unicode

Total characters1310943
Distinct characters84
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique18818 ?
Unique (%)99.7%

Sample

1st rowMAM number 17903; female; personal specimen number MFH 162; testes 5 x 2 mm
2nd rowMAM number 17889; female
3rd rowMAM number 17897; male
4th rowMAM number 17895; male
5th rowMAM number 17888; female
ValueCountFrequency (%)
number 29739
 
16.2%
mam 18873
 
10.3%
original 6652
 
3.6%
catalog 6652
 
3.6%
male 5021
 
2.7%
osteo 4618
 
2.5%
specimen 4419
 
2.4%
personal 4201
 
2.3%
female 4185
 
2.3%
accn=ypm.12236 2399
 
1.3%
Other values (25895) 96369
52.6%
2025-01-14T11:27:37.395157image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
164262
 
12.5%
e 84610
 
6.5%
n 72932
 
5.6%
a 67392
 
5.1%
M 58097
 
4.4%
m 52492
 
4.0%
r 52119
 
4.0%
c 44063
 
3.4%
1 42870
 
3.3%
o 40086
 
3.1%
Other values (74) 632020
48.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 689220
52.6%
Decimal Number 220996
 
16.9%
Space Separator 164262
 
12.5%
Uppercase Letter 139813
 
10.7%
Other Punctuation 71758
 
5.5%
Math Symbol 13316
 
1.0%
Open Punctuation 5176
 
0.4%
Close Punctuation 5170
 
0.4%
Dash Punctuation 1232
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 84610
12.3%
n 72932
10.6%
a 67392
9.8%
m 52492
 
7.6%
r 52119
 
7.6%
c 44063
 
6.4%
o 40086
 
5.8%
l 39877
 
5.8%
u 37915
 
5.5%
b 35664
 
5.2%
Other values (16) 162070
23.5%
Uppercase Letter
ValueCountFrequency (%)
M 58097
41.6%
A 29360
21.0%
P 10732
 
7.7%
O 9348
 
6.7%
Y 8914
 
6.4%
V 3906
 
2.8%
Z 3819
 
2.7%
R 3460
 
2.5%
S 2466
 
1.8%
B 1899
 
1.4%
Other values (16) 7812
 
5.6%
Decimal Number
ValueCountFrequency (%)
1 42870
19.4%
0 35267
16.0%
2 23803
10.8%
3 21594
9.8%
4 21563
9.8%
6 20089
9.1%
5 16601
 
7.5%
7 14226
 
6.4%
9 13052
 
5.9%
8 11931
 
5.4%
Other Punctuation
ValueCountFrequency (%)
; 39221
54.7%
. 27314
38.1%
, 4064
 
5.7%
: 788
 
1.1%
/ 142
 
0.2%
? 123
 
0.2%
" 56
 
0.1%
' 32
 
< 0.1%
& 18
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 13301
99.9%
+ 10
 
0.1%
± 3
 
< 0.1%
~ 1
 
< 0.1%
> 1
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 5170
99.9%
[ 5
 
0.1%
{ 1
 
< 0.1%
Close Punctuation
ValueCountFrequency (%)
) 5164
99.9%
] 5
 
0.1%
} 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
164262
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1232
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 829033
63.2%
Common 481910
36.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 84610
 
10.2%
n 72932
 
8.8%
a 67392
 
8.1%
M 58097
 
7.0%
m 52492
 
6.3%
r 52119
 
6.3%
c 44063
 
5.3%
o 40086
 
4.8%
l 39877
 
4.8%
u 37915
 
4.6%
Other values (42) 279450
33.7%
Common
ValueCountFrequency (%)
164262
34.1%
1 42870
 
8.9%
; 39221
 
8.1%
0 35267
 
7.3%
. 27314
 
5.7%
2 23803
 
4.9%
3 21594
 
4.5%
4 21563
 
4.5%
6 20089
 
4.2%
5 16601
 
3.4%
Other values (22) 69326
14.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1310940
> 99.9%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
164262
 
12.5%
e 84610
 
6.5%
n 72932
 
5.6%
a 67392
 
5.1%
M 58097
 
4.4%
m 52492
 
4.0%
r 52119
 
4.0%
c 44063
 
3.4%
1 42870
 
3.3%
o 40086
 
3.1%
Other values (73) 632017
48.2%
None
ValueCountFrequency (%)
± 3
100.0%
Distinct3180
Distinct (%)17.0%
Missing152
Missing (%)0.8%
Memory size147.5 KiB
2025-01-14T11:27:37.560959image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length135
Median length105
Mean length29.92679278
Min length3

Characters and Unicode

Total characters560050
Distinct characters55
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1569 ?
Unique (%)8.4%

Sample

1st rowTamias striatus fisheri
2nd rowPeromyscus leucopus noveboracensis
3rd rowPeromyscus leucopus noveboracensis
4th rowPeromyscus leucopus noveboracensis
5th rowPeromyscus leucopus noveboracensis
ValueCountFrequency (%)
peromyscus 1837
 
3.4%
gapperi 1530
 
2.8%
cinereus 1460
 
2.7%
brevicauda 1361
 
2.5%
sorex 1193
 
2.2%
blarina 976
 
1.8%
maniculatus 919
 
1.7%
zibethicus 906
 
1.7%
leucopus 836
 
1.6%
talpoides 759
 
1.4%
Other values (3630) 42002
78.1%
2025-01-14T11:27:37.802572image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 54787
 
9.8%
i 49194
 
8.8%
a 47829
 
8.5%
u 39741
 
7.1%
e 39634
 
7.1%
r 35507
 
6.3%
35065
 
6.3%
o 33086
 
5.9%
n 28079
 
5.0%
c 25884
 
4.6%
Other values (45) 171244
30.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 491072
87.7%
Space Separator 35065
 
6.3%
Uppercase Letter 26308
 
4.7%
Math Symbol 7591
 
1.4%
Other Punctuation 10
 
< 0.1%
Dash Punctuation 4
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 54787
11.2%
i 49194
10.0%
a 47829
9.7%
u 39741
 
8.1%
e 39634
 
8.1%
r 35507
 
7.2%
o 33086
 
6.7%
n 28079
 
5.7%
c 25884
 
5.3%
l 22447
 
4.6%
Other values (16) 114884
23.4%
Uppercase Letter
ValueCountFrequency (%)
P 3888
14.8%
C 3681
14.0%
M 3187
12.1%
S 2683
10.2%
B 1796
 
6.8%
T 1608
 
6.1%
O 1561
 
5.9%
N 1123
 
4.3%
A 925
 
3.5%
L 925
 
3.5%
Other values (14) 4931
18.7%
Other Punctuation
ValueCountFrequency (%)
. 8
80.0%
? 2
 
20.0%
Space Separator
ValueCountFrequency (%)
35065
100.0%
Math Symbol
ValueCountFrequency (%)
| 7591
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 517380
92.4%
Common 42670
 
7.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 54787
 
10.6%
i 49194
 
9.5%
a 47829
 
9.2%
u 39741
 
7.7%
e 39634
 
7.7%
r 35507
 
6.9%
o 33086
 
6.4%
n 28079
 
5.4%
c 25884
 
5.0%
l 22447
 
4.3%
Other values (40) 141192
27.3%
Common
ValueCountFrequency (%)
35065
82.2%
| 7591
 
17.8%
. 8
 
< 0.1%
- 4
 
< 0.1%
? 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 560050
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 54787
 
9.8%
i 49194
 
8.8%
a 47829
 
8.5%
u 39741
 
7.1%
e 39634
 
7.1%
r 35507
 
6.3%
35065
 
6.3%
o 33086
 
5.9%
n 28079
 
5.0%
c 25884
 
4.6%
Other values (45) 171244
30.6%

fieldNumber
Text

Missing 

Distinct5159
Distinct (%)70.6%
Missing11555
Missing (%)61.2%
Memory size147.5 KiB
2025-01-14T11:27:37.985518image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length22
Median length16
Mean length4.113664341
Min length1

Characters and Unicode

Total characters30075
Distinct characters68
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4249 ?
Unique (%)58.1%

Sample

1st row14251
2nd rowP5
3rd rowP14
4th rowP12
5th rowP4
ValueCountFrequency (%)
f 452
 
5.3%
r 169
 
2.0%
l 162
 
1.9%
mcz 50
 
0.6%
2 44
 
0.5%
3 43
 
0.5%
1 42
 
0.5%
5 38
 
0.4%
jas 32
 
0.4%
4 31
 
0.4%
Other values (4656) 7419
87.5%
2025-01-14T11:27:38.215564image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 4503
15.0%
3 2724
9.1%
4 2723
9.1%
2 2604
8.7%
0 2480
8.2%
8 2138
 
7.1%
9 1836
 
6.1%
7 1835
 
6.1%
5 1834
 
6.1%
6 1742
 
5.8%
Other values (58) 5656
18.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 24419
81.2%
Uppercase Letter 3351
 
11.1%
Space Separator 1171
 
3.9%
Dash Punctuation 829
 
2.8%
Lowercase Letter 148
 
0.5%
Open Punctuation 53
 
0.2%
Close Punctuation 53
 
0.2%
Other Punctuation 51
 
0.2%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
F 853
25.5%
R 343
10.2%
Q 331
 
9.9%
A 229
 
6.8%
M 198
 
5.9%
L 178
 
5.3%
B 172
 
5.1%
Z 145
 
4.3%
C 133
 
4.0%
P 131
 
3.9%
Other values (16) 638
19.0%
Lowercase Letter
ValueCountFrequency (%)
a 24
16.2%
m 17
11.5%
l 16
10.8%
e 13
8.8%
o 12
 
8.1%
t 10
 
6.8%
r 8
 
5.4%
i 7
 
4.7%
n 6
 
4.1%
p 5
 
3.4%
Other values (10) 30
20.3%
Decimal Number
ValueCountFrequency (%)
1 4503
18.4%
3 2724
11.2%
4 2723
11.2%
2 2604
10.7%
0 2480
10.2%
8 2138
8.8%
9 1836
7.5%
7 1835
7.5%
5 1834
7.5%
6 1742
 
7.1%
Other Punctuation
ValueCountFrequency (%)
. 29
56.9%
? 9
 
17.6%
/ 7
 
13.7%
# 3
 
5.9%
; 2
 
3.9%
: 1
 
2.0%
Open Punctuation
ValueCountFrequency (%)
[ 52
98.1%
( 1
 
1.9%
Close Punctuation
ValueCountFrequency (%)
] 52
98.1%
) 1
 
1.9%
Space Separator
ValueCountFrequency (%)
1171
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 829
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 26576
88.4%
Latin 3499
 
11.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
F 853
24.4%
R 343
9.8%
Q 331
 
9.5%
A 229
 
6.5%
M 198
 
5.7%
L 178
 
5.1%
B 172
 
4.9%
Z 145
 
4.1%
C 133
 
3.8%
P 131
 
3.7%
Other values (36) 786
22.5%
Common
ValueCountFrequency (%)
1 4503
16.9%
3 2724
10.2%
4 2723
10.2%
2 2604
9.8%
0 2480
9.3%
8 2138
8.0%
9 1836
6.9%
7 1835
6.9%
5 1834
6.9%
6 1742
 
6.6%
Other values (12) 2157
8.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 30075
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 4503
15.0%
3 2724
9.1%
4 2723
9.1%
2 2604
8.7%
0 2480
8.2%
8 2138
 
7.1%
9 1836
 
6.1%
7 1835
 
6.1%
5 1834
 
6.1%
6 1742
 
5.8%
Other values (58) 5656
18.8%

eventDate
Text

Missing 

Distinct3892
Distinct (%)30.8%
Missing6221
Missing (%)33.0%
Memory size147.5 KiB
2025-01-14T11:27:38.416069image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length21
Median length10
Mean length9.587742191
Min length4

Characters and Unicode

Total characters121237
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2081 ?
Unique (%)16.5%

Sample

1st row2024-08-15
2nd row2023-12-01
3rd row2023-12-28
4th row2023-12-20
5th row2023-11-30
ValueCountFrequency (%)
2012-07-18 178
 
1.4%
2012-07-15 170
 
1.3%
1959 162
 
1.3%
1970/1973 156
 
1.2%
2012-07-16 150
 
1.2%
2012-07-24 144
 
1.1%
2013-08-02 109
 
0.9%
2020-10-07 108
 
0.9%
2020-10-14 100
 
0.8%
2020-10-08 96
 
0.8%
Other values (3882) 11272
89.1%
2025-01-14T11:27:38.681655image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 22749
18.8%
0 22201
18.3%
1 21441
17.7%
2 12804
10.6%
9 11089
9.1%
7 6180
 
5.1%
6 5838
 
4.8%
5 5285
 
4.4%
3 4816
 
4.0%
8 4771
 
3.9%
Other values (2) 4063
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 98006
80.8%
Dash Punctuation 22749
 
18.8%
Other Punctuation 482
 
0.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 22201
22.7%
1 21441
21.9%
2 12804
13.1%
9 11089
11.3%
7 6180
 
6.3%
6 5838
 
6.0%
5 5285
 
5.4%
3 4816
 
4.9%
8 4771
 
4.9%
4 3581
 
3.7%
Dash Punctuation
ValueCountFrequency (%)
- 22749
100.0%
Other Punctuation
ValueCountFrequency (%)
/ 482
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 121237
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
- 22749
18.8%
0 22201
18.3%
1 21441
17.7%
2 12804
10.6%
9 11089
9.1%
7 6180
 
5.1%
6 5838
 
4.8%
5 5285
 
4.4%
3 4816
 
4.0%
8 4771
 
3.9%
Other values (2) 4063
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 121237
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 22749
18.8%
0 22201
18.3%
1 21441
17.7%
2 12804
10.6%
9 11089
9.1%
7 6180
 
5.1%
6 5838
 
4.8%
5 5285
 
4.4%
3 4816
 
4.0%
8 4771
 
3.9%
Other values (2) 4063
 
3.4%

year
Text

Missing 

Distinct157
Distinct (%)1.2%
Missing6267
Missing (%)33.2%
Memory size147.5 KiB
2025-01-14T11:27:38.839560image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters50396
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.1%

Sample

1st row2024
2nd row2023
3rd row2023
4th row2023
5th row2023
ValueCountFrequency (%)
2013 864
 
6.9%
2012 821
 
6.5%
2020 800
 
6.3%
2014 728
 
5.8%
1965 714
 
5.7%
1962 345
 
2.7%
1956 329
 
2.6%
1964 295
 
2.3%
1959 285
 
2.3%
1952 274
 
2.2%
Other values (147) 7144
56.7%
2025-01-14T11:27:39.047540image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 12085
24.0%
9 8638
17.1%
2 7713
15.3%
0 6850
13.6%
5 3400
 
6.7%
6 3380
 
6.7%
3 2762
 
5.5%
7 2062
 
4.1%
4 1835
 
3.6%
8 1671
 
3.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 50396
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 12085
24.0%
9 8638
17.1%
2 7713
15.3%
0 6850
13.6%
5 3400
 
6.7%
6 3380
 
6.7%
3 2762
 
5.5%
7 2062
 
4.1%
4 1835
 
3.6%
8 1671
 
3.3%

Most occurring scripts

ValueCountFrequency (%)
Common 50396
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 12085
24.0%
9 8638
17.1%
2 7713
15.3%
0 6850
13.6%
5 3400
 
6.7%
6 3380
 
6.7%
3 2762
 
5.5%
7 2062
 
4.1%
4 1835
 
3.6%
8 1671
 
3.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 50396
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 12085
24.0%
9 8638
17.1%
2 7713
15.3%
0 6850
13.6%
5 3400
 
6.7%
6 3380
 
6.7%
3 2762
 
5.5%
7 2062
 
4.1%
4 1835
 
3.6%
8 1671
 
3.3%

month
Text

Missing 

Distinct12
Distinct (%)0.1%
Missing7343
Missing (%)38.9%
Memory size147.5 KiB
2025-01-14T11:27:39.108328image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length1
Mean length1.202638202
Min length1

Characters and Unicode

Total characters13858
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row8
2nd row12
3rd row12
4th row12
5th row11
ValueCountFrequency (%)
7 2664
23.1%
8 1680
14.6%
10 1318
11.4%
6 1186
10.3%
9 830
 
7.2%
1 720
 
6.2%
11 609
 
5.3%
5 592
 
5.1%
4 512
 
4.4%
3 509
 
4.4%
Other values (2) 903
 
7.8%
2025-01-14T11:27:39.211872image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 3664
26.4%
7 2664
19.2%
8 1680
12.1%
0 1318
 
9.5%
6 1186
 
8.6%
2 903
 
6.5%
9 830
 
6.0%
5 592
 
4.3%
4 512
 
3.7%
3 509
 
3.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 13858
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 3664
26.4%
7 2664
19.2%
8 1680
12.1%
0 1318
 
9.5%
6 1186
 
8.6%
2 903
 
6.5%
9 830
 
6.0%
5 592
 
4.3%
4 512
 
3.7%
3 509
 
3.7%

Most occurring scripts

ValueCountFrequency (%)
Common 13858
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 3664
26.4%
7 2664
19.2%
8 1680
12.1%
0 1318
 
9.5%
6 1186
 
8.6%
2 903
 
6.5%
9 830
 
6.0%
5 592
 
4.3%
4 512
 
3.7%
3 509
 
3.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 13858
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 3664
26.4%
7 2664
19.2%
8 1680
12.1%
0 1318
 
9.5%
6 1186
 
8.6%
2 903
 
6.5%
9 830
 
6.0%
5 592
 
4.3%
4 512
 
3.7%
3 509
 
3.7%

day
Text

Missing 

Distinct31
Distinct (%)0.3%
Missing7899
Missing (%)41.9%
Memory size147.5 KiB
2025-01-14T11:27:39.278511image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length2
Median length2
Mean length1.677122276
Min length1

Characters and Unicode

Total characters18393
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15
2nd row1
3rd row28
4th row20
5th row30
ValueCountFrequency (%)
18 555
 
5.1%
15 518
 
4.7%
7 470
 
4.3%
16 465
 
4.2%
8 445
 
4.1%
9 434
 
4.0%
2 429
 
3.9%
24 429
 
3.9%
19 410
 
3.7%
4 386
 
3.5%
Other values (21) 6426
58.6%
2025-01-14T11:27:39.403188image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 5094
27.7%
2 4015
21.8%
3 1363
 
7.4%
8 1242
 
6.8%
4 1181
 
6.4%
7 1155
 
6.3%
5 1152
 
6.3%
6 1112
 
6.0%
9 1087
 
5.9%
0 992
 
5.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 18393
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 5094
27.7%
2 4015
21.8%
3 1363
 
7.4%
8 1242
 
6.8%
4 1181
 
6.4%
7 1155
 
6.3%
5 1152
 
6.3%
6 1112
 
6.0%
9 1087
 
5.9%
0 992
 
5.4%

Most occurring scripts

ValueCountFrequency (%)
Common 18393
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 5094
27.7%
2 4015
21.8%
3 1363
 
7.4%
8 1242
 
6.8%
4 1181
 
6.4%
7 1155
 
6.3%
5 1152
 
6.3%
6 1112
 
6.0%
9 1087
 
5.9%
0 992
 
5.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 18393
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 5094
27.7%
2 4015
21.8%
3 1363
 
7.4%
8 1242
 
6.8%
4 1181
 
6.4%
7 1155
 
6.3%
5 1152
 
6.3%
6 1112
 
6.0%
9 1087
 
5.9%
0 992
 
5.4%

habitat
Text

Missing 

Distinct49
Distinct (%)38.6%
Missing18739
Missing (%)99.3%
Memory size147.5 KiB
2025-01-14T11:27:39.552896image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length185
Median length88
Mean length16.97637795
Min length5

Characters and Unicode

Total characters2156
Distinct characters46
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique38 ?
Unique (%)29.9%

Sample

1st rowUrban
2nd rowUrban
3rd rowUrban
4th rowUrban
5th rowUrban
ValueCountFrequency (%)
urban 50
 
14.2%
in 21
 
5.9%
suburban 18
 
5.1%
forest 10
 
2.8%
by 8
 
2.3%
pine 7
 
2.0%
open 6
 
1.7%
of 6
 
1.7%
ponderosa 6
 
1.7%
soil 5
 
1.4%
Other values (132) 216
61.2%
2025-01-14T11:27:39.786156image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
226
 
10.5%
a 205
 
9.5%
n 189
 
8.8%
r 178
 
8.3%
e 162
 
7.5%
o 131
 
6.1%
s 119
 
5.5%
b 116
 
5.4%
i 105
 
4.9%
t 98
 
4.5%
Other values (36) 627
29.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1794
83.2%
Space Separator 226
 
10.5%
Uppercase Letter 100
 
4.6%
Other Punctuation 31
 
1.4%
Decimal Number 3
 
0.1%
Dash Punctuation 2
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 205
11.4%
n 189
10.5%
r 178
9.9%
e 162
 
9.0%
o 131
 
7.3%
s 119
 
6.6%
b 116
 
6.5%
i 105
 
5.9%
t 98
 
5.5%
d 81
 
4.5%
Other values (14) 410
22.9%
Uppercase Letter
ValueCountFrequency (%)
U 50
50.0%
S 19
 
19.0%
P 11
 
11.0%
W 5
 
5.0%
R 3
 
3.0%
B 3
 
3.0%
C 3
 
3.0%
E 2
 
2.0%
V 1
 
1.0%
F 1
 
1.0%
Other values (2) 2
 
2.0%
Other Punctuation
ValueCountFrequency (%)
, 20
64.5%
. 4
 
12.9%
; 3
 
9.7%
" 2
 
6.5%
: 1
 
3.2%
' 1
 
3.2%
Decimal Number
ValueCountFrequency (%)
0 2
66.7%
1 1
33.3%
Space Separator
ValueCountFrequency (%)
226
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1894
87.8%
Common 262
 
12.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 205
10.8%
n 189
 
10.0%
r 178
 
9.4%
e 162
 
8.6%
o 131
 
6.9%
s 119
 
6.3%
b 116
 
6.1%
i 105
 
5.5%
t 98
 
5.2%
d 81
 
4.3%
Other values (26) 510
26.9%
Common
ValueCountFrequency (%)
226
86.3%
, 20
 
7.6%
. 4
 
1.5%
; 3
 
1.1%
" 2
 
0.8%
- 2
 
0.8%
0 2
 
0.8%
: 1
 
0.4%
1 1
 
0.4%
' 1
 
0.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2156
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
226
 
10.5%
a 205
 
9.5%
n 189
 
8.8%
r 178
 
8.3%
e 162
 
7.5%
o 131
 
6.1%
s 119
 
5.5%
b 116
 
5.4%
i 105
 
4.9%
t 98
 
4.5%
Other values (36) 627
29.1%

higherGeography
Text

Missing 

Distinct951
Distinct (%)6.3%
Missing3778
Missing (%)20.0%
Memory size147.5 KiB
2025-01-14T11:27:39.982693image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length74
Median length66
Mean length40.53313892
Min length4

Characters and Unicode

Total characters611564
Distinct characters63
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique314 ?
Unique (%)2.1%

Sample

1st rowNorth America; USA; Connecticut; New Haven County
2nd rowNorth America; USA; Connecticut; Middlesex County
3rd rowNorth America; USA; Connecticut; Middlesex County
4th rowNorth America; USA; Connecticut; Middlesex County
5th rowNorth America; USA; Connecticut; Middlesex County
ValueCountFrequency (%)
america 11919
14.1%
north 11535
13.6%
usa 10091
 
11.9%
county 9449
 
11.1%
new 4323
 
5.1%
hampshire 2881
 
3.4%
carroll 2750
 
3.2%
africa 2011
 
2.4%
connecticut 1497
 
1.8%
province 1319
 
1.6%
Other values (974) 27017
31.9%
2025-01-14T11:27:40.249222image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
69704
 
11.4%
r 45359
 
7.4%
a 42864
 
7.0%
o 40987
 
6.7%
; 38761
 
6.3%
e 35621
 
5.8%
i 34064
 
5.6%
t 32303
 
5.3%
n 28583
 
4.7%
A 26232
 
4.3%
Other values (53) 217086
35.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 397956
65.1%
Uppercase Letter 105028
 
17.2%
Space Separator 69704
 
11.4%
Other Punctuation 38819
 
6.3%
Dash Punctuation 57
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 45359
11.4%
a 42864
10.8%
o 40987
10.3%
e 35621
9.0%
i 34064
8.6%
t 32303
 
8.1%
n 28583
 
7.2%
c 22577
 
5.7%
h 17759
 
4.5%
m 16806
 
4.2%
Other values (20) 81033
20.4%
Uppercase Letter
ValueCountFrequency (%)
A 26232
25.0%
C 17618
16.8%
N 16467
15.7%
S 12399
11.8%
U 10244
 
9.8%
H 3948
 
3.8%
M 2894
 
2.8%
P 2510
 
2.4%
E 1256
 
1.2%
G 1207
 
1.1%
Other values (16) 10253
 
9.8%
Other Punctuation
ValueCountFrequency (%)
; 38761
99.9%
. 28
 
0.1%
' 28
 
0.1%
& 1
 
< 0.1%
, 1
 
< 0.1%
Space Separator
ValueCountFrequency (%)
69704
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 57
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 502984
82.2%
Common 108580
 
17.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 45359
 
9.0%
a 42864
 
8.5%
o 40987
 
8.1%
e 35621
 
7.1%
i 34064
 
6.8%
t 32303
 
6.4%
n 28583
 
5.7%
A 26232
 
5.2%
c 22577
 
4.5%
h 17759
 
3.5%
Other values (46) 176635
35.1%
Common
ValueCountFrequency (%)
69704
64.2%
; 38761
35.7%
- 57
 
0.1%
. 28
 
< 0.1%
' 28
 
< 0.1%
& 1
 
< 0.1%
, 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 611460
> 99.9%
None 104
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
69704
 
11.4%
r 45359
 
7.4%
a 42864
 
7.0%
o 40987
 
6.7%
; 38761
 
6.3%
e 35621
 
5.8%
i 34064
 
5.6%
t 32303
 
5.3%
n 28583
 
4.7%
A 26232
 
4.3%
Other values (48) 216982
35.5%
None
ValueCountFrequency (%)
á 72
69.2%
í 16
 
15.4%
é 11
 
10.6%
ó 4
 
3.8%
Á 1
 
1.0%

continent
Text

Missing 

Distinct6
Distinct (%)< 0.1%
Missing3913
Missing (%)20.7%
Memory size147.5 KiB
2025-01-14T11:27:40.310538image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length13
Median length13
Mean length11.50270849
Min length4

Characters and Unicode

Total characters172000
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNorth America
2nd rowNorth America
3rd rowNorth America
4th rowNorth America
5th rowNorth America
ValueCountFrequency (%)
america 11919
44.4%
north 11399
42.4%
africa 1984
 
7.4%
asia 640
 
2.4%
south 520
 
1.9%
europe 281
 
1.0%
oceania 129
 
0.5%
2025-01-14T11:27:40.414299image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
r 25583
14.9%
a 14801
8.6%
i 14672
8.5%
A 14543
8.5%
c 14032
8.2%
e 12329
7.2%
o 12200
7.1%
t 11919
6.9%
h 11919
6.9%
11919
6.9%
Other values (10) 28083
16.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 133209
77.4%
Uppercase Letter 26872
 
15.6%
Space Separator 11919
 
6.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 25583
19.2%
a 14801
11.1%
i 14672
11.0%
c 14032
10.5%
e 12329
9.3%
o 12200
9.2%
t 11919
8.9%
h 11919
8.9%
m 11919
8.9%
f 1984
 
1.5%
Other values (4) 1851
 
1.4%
Uppercase Letter
ValueCountFrequency (%)
A 14543
54.1%
N 11399
42.4%
S 520
 
1.9%
E 281
 
1.0%
O 129
 
0.5%
Space Separator
ValueCountFrequency (%)
11919
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 160081
93.1%
Common 11919
 
6.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 25583
16.0%
a 14801
9.2%
i 14672
9.2%
A 14543
9.1%
c 14032
8.8%
e 12329
7.7%
o 12200
7.6%
t 11919
7.4%
h 11919
7.4%
m 11919
7.4%
Other values (9) 16164
10.1%
Common
ValueCountFrequency (%)
11919
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 172000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r 25583
14.9%
a 14801
8.6%
i 14672
8.5%
A 14543
8.5%
c 14032
8.2%
e 12329
7.2%
o 12200
7.1%
t 11919
6.9%
h 11919
6.9%
11919
6.9%
Other values (10) 28083
16.3%

waterBody
Text

Missing 

Distinct7
Distinct (%)5.5%
Missing18739
Missing (%)99.3%
Memory size147.5 KiB
2025-01-14T11:27:40.466999image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length38
Median length29
Mean length23.07874016
Min length12

Characters and Unicode

Total characters2931
Distinct characters25
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)2.4%

Sample

1st rowAtlantic Ocean; Caribbean Sea
2nd rowAtlantic Ocean; Caribbean Sea
3rd rowAtlantic Ocean; Caribbean Sea
4th rowAtlantic Ocean; Caribbean Sea
5th rowAtlantic Ocean; Caribbean Sea
ValueCountFrequency (%)
ocean 127
30.5%
atlantic 87
20.9%
sea 79
19.0%
caribbean 78
18.8%
pacific 30
 
7.2%
indian 9
 
2.2%
arctic 1
 
0.2%
red 1
 
0.2%
gulf 1
 
0.2%
of 1
 
0.2%
Other values (2) 2
 
0.5%
2025-01-14T11:27:40.571612image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 490
16.7%
n 312
10.6%
289
9.9%
e 287
9.8%
c 277
9.5%
i 236
8.1%
t 176
 
6.0%
b 156
 
5.3%
O 127
 
4.3%
A 88
 
3.0%
Other values (15) 493
16.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2147
73.3%
Uppercase Letter 415
 
14.2%
Space Separator 289
 
9.9%
Other Punctuation 80
 
2.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 490
22.8%
n 312
14.5%
e 287
13.4%
c 277
12.9%
i 236
11.0%
t 176
 
8.2%
b 156
 
7.3%
l 88
 
4.1%
r 80
 
3.7%
f 32
 
1.5%
Other values (4) 13
 
0.6%
Uppercase Letter
ValueCountFrequency (%)
O 127
30.6%
A 88
21.2%
S 80
19.3%
C 78
18.8%
P 30
 
7.2%
I 9
 
2.2%
R 1
 
0.2%
G 1
 
0.2%
L 1
 
0.2%
Space Separator
ValueCountFrequency (%)
289
100.0%
Other Punctuation
ValueCountFrequency (%)
; 80
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2562
87.4%
Common 369
 
12.6%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 490
19.1%
n 312
12.2%
e 287
11.2%
c 277
10.8%
i 236
9.2%
t 176
 
6.9%
b 156
 
6.1%
O 127
 
5.0%
A 88
 
3.4%
l 88
 
3.4%
Other values (13) 325
12.7%
Common
ValueCountFrequency (%)
289
78.3%
; 80
 
21.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2931
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 490
16.7%
n 312
10.6%
289
9.9%
e 287
9.8%
c 277
9.5%
i 236
8.1%
t 176
 
6.0%
b 156
 
5.3%
O 127
 
4.3%
A 88
 
3.0%
Other values (15) 493
16.8%

country
Text

Missing 

Distinct105
Distinct (%)0.7%
Missing3927
Missing (%)20.8%
Memory size147.5 KiB
2025-01-14T11:27:40.678529image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length28
Median length3
Mean length4.21547627
Min length3

Characters and Unicode

Total characters62975
Distinct characters48
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique17 ?
Unique (%)0.1%

Sample

1st rowUSA
2nd rowUSA
3rd rowUSA
4th rowUSA
5th rowUSA
ValueCountFrequency (%)
usa 10091
66.2%
canada 686
 
4.5%
kenya 667
 
4.4%
mexico 578
 
3.8%
egypt 430
 
2.8%
indonesia 279
 
1.8%
cameroon 254
 
1.7%
ecuador 236
 
1.5%
greece 138
 
0.9%
australia 112
 
0.7%
Other values (112) 1762
 
11.6%
2025-01-14T11:27:40.844066image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
A 10274
16.3%
S 10260
16.3%
U 10116
16.1%
a 5889
9.4%
n 3063
 
4.9%
e 2749
 
4.4%
i 2122
 
3.4%
o 2093
 
3.3%
d 1537
 
2.4%
r 1381
 
2.2%
Other values (38) 13491
21.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 35368
56.2%
Lowercase Letter 27313
43.4%
Space Separator 294
 
0.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 5889
21.6%
n 3063
11.2%
e 2749
10.1%
i 2122
 
7.8%
o 2093
 
7.7%
d 1537
 
5.6%
r 1381
 
5.1%
c 1282
 
4.7%
y 1175
 
4.3%
g 755
 
2.8%
Other values (16) 5267
19.3%
Uppercase Letter
ValueCountFrequency (%)
A 10274
29.0%
S 10260
29.0%
U 10116
28.6%
C 1140
 
3.2%
K 728
 
2.1%
M 702
 
2.0%
E 681
 
1.9%
I 424
 
1.2%
G 224
 
0.6%
B 149
 
0.4%
Other values (11) 670
 
1.9%
Space Separator
ValueCountFrequency (%)
294
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 62681
99.5%
Common 294
 
0.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
A 10274
16.4%
S 10260
16.4%
U 10116
16.1%
a 5889
9.4%
n 3063
 
4.9%
e 2749
 
4.4%
i 2122
 
3.4%
o 2093
 
3.3%
d 1537
 
2.5%
r 1381
 
2.2%
Other values (37) 13197
21.1%
Common
ValueCountFrequency (%)
294
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 62975
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
A 10274
16.3%
S 10260
16.3%
U 10116
16.1%
a 5889
9.4%
n 3063
 
4.9%
e 2749
 
4.4%
i 2122
 
3.4%
o 2093
 
3.3%
d 1537
 
2.4%
r 1381
 
2.2%
Other values (38) 13491
21.4%

stateProvince
Text

Missing 

Distinct260
Distinct (%)1.9%
Missing5347
Missing (%)28.3%
Memory size147.5 KiB
2025-01-14T11:27:41.025151image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length32
Median length25
Mean length11.24032843
Min length3

Characters and Unicode

Total characters151958
Distinct characters58
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)0.5%

Sample

1st rowConnecticut
2nd rowConnecticut
3rd rowConnecticut
4th rowConnecticut
5th rowConnecticut
ValueCountFrequency (%)
new 3586
17.2%
hampshire 2877
 
13.8%
connecticut 1497
 
7.2%
province 1288
 
6.2%
state 613
 
2.9%
minnesota 580
 
2.8%
york 506
 
2.4%
colorado 463
 
2.2%
arizona 438
 
2.1%
wisconsin 425
 
2.0%
Other values (287) 8636
41.3%
2025-01-14T11:27:41.283055image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 14301
 
9.4%
a 13747
 
9.0%
i 12892
 
8.5%
n 10792
 
7.1%
o 10626
 
7.0%
r 9355
 
6.2%
t 8029
 
5.3%
s 7835
 
5.2%
7390
 
4.9%
c 5643
 
3.7%
Other values (48) 51348
33.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 123691
81.4%
Uppercase Letter 20861
 
13.7%
Space Separator 7390
 
4.9%
Dash Punctuation 15
 
< 0.1%
Other Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 14301
11.6%
a 13747
11.1%
i 12892
10.4%
n 10792
8.7%
o 10626
8.6%
r 9355
 
7.6%
t 8029
 
6.5%
s 7835
 
6.3%
c 5643
 
4.6%
h 4465
 
3.6%
Other values (20) 26006
21.0%
Uppercase Letter
ValueCountFrequency (%)
N 4092
19.6%
C 3272
15.7%
H 2932
14.1%
P 1777
8.5%
M 1528
 
7.3%
A 1144
 
5.5%
S 846
 
4.1%
W 788
 
3.8%
V 591
 
2.8%
Y 507
 
2.4%
Other values (15) 3384
16.2%
Space Separator
ValueCountFrequency (%)
7390
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 15
100.0%
Other Punctuation
ValueCountFrequency (%)
' 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 144552
95.1%
Common 7406
 
4.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 14301
 
9.9%
a 13747
 
9.5%
i 12892
 
8.9%
n 10792
 
7.5%
o 10626
 
7.4%
r 9355
 
6.5%
t 8029
 
5.6%
s 7835
 
5.4%
c 5643
 
3.9%
h 4465
 
3.1%
Other values (45) 46867
32.4%
Common
ValueCountFrequency (%)
7390
99.8%
- 15
 
0.2%
' 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 151865
99.9%
None 93
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 14301
 
9.4%
a 13747
 
9.1%
i 12892
 
8.5%
n 10792
 
7.1%
o 10626
 
7.0%
r 9355
 
6.2%
t 8029
 
5.3%
s 7835
 
5.2%
7390
 
4.9%
c 5643
 
3.7%
Other values (44) 51255
33.8%
None
ValueCountFrequency (%)
á 72
77.4%
í 16
 
17.2%
ó 3
 
3.2%
é 2
 
2.2%

county
Text

Missing 

Distinct484
Distinct (%)5.0%
Missing9192
Missing (%)48.7%
Memory size147.5 KiB
2025-01-14T11:27:41.473683image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length28
Median length27
Mean length14.43198263
Min length6

Characters and Unicode

Total characters139615
Distinct characters57
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique154 ?
Unique (%)1.6%

Sample

1st rowNew Haven County
2nd rowMiddlesex County
3rd rowMiddlesex County
4th rowMiddlesex County
5th rowMiddlesex County
ValueCountFrequency (%)
county 9433
45.6%
carroll 2750
 
13.3%
new 705
 
3.4%
haven 655
 
3.2%
cass 356
 
1.7%
litchfield 334
 
1.6%
gunnison 275
 
1.3%
fairfield 220
 
1.1%
iron 203
 
1.0%
middlesex 167
 
0.8%
Other values (517) 5606
27.1%
2025-01-14T11:27:41.728323image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 15785
11.3%
n 14029
10.0%
C 13107
9.4%
t 11232
 
8.0%
11030
 
7.9%
u 10798
 
7.7%
y 9773
 
7.0%
r 8601
 
6.2%
l 7999
 
5.7%
a 7541
 
5.4%
Other values (47) 29720
21.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 107635
77.1%
Uppercase Letter 20854
 
14.9%
Space Separator 11030
 
7.9%
Other Punctuation 55
 
< 0.1%
Dash Punctuation 41
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o 15785
14.7%
n 14029
13.0%
t 11232
10.4%
u 10798
10.0%
y 9773
9.1%
r 8601
8.0%
l 7999
7.4%
a 7541
7.0%
e 5545
 
5.2%
i 3786
 
3.5%
Other values (18) 12546
11.7%
Uppercase Letter
ValueCountFrequency (%)
C 13107
62.9%
H 924
 
4.4%
L 867
 
4.2%
N 832
 
4.0%
S 670
 
3.2%
M 609
 
2.9%
F 535
 
2.6%
G 503
 
2.4%
P 490
 
2.3%
B 456
 
2.2%
Other values (15) 1861
 
8.9%
Other Punctuation
ValueCountFrequency (%)
. 28
50.9%
' 27
49.1%
Space Separator
ValueCountFrequency (%)
11030
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 41
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 128489
92.0%
Common 11126
 
8.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o 15785
12.3%
n 14029
10.9%
C 13107
10.2%
t 11232
8.7%
u 10798
8.4%
y 9773
7.6%
r 8601
 
6.7%
l 7999
 
6.2%
a 7541
 
5.9%
e 5545
 
4.3%
Other values (43) 24079
18.7%
Common
ValueCountFrequency (%)
11030
99.1%
- 41
 
0.4%
. 28
 
0.3%
' 27
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 139604
> 99.9%
None 11
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o 15785
11.3%
n 14029
10.0%
C 13107
9.4%
t 11232
 
8.0%
11030
 
7.9%
u 10798
 
7.7%
y 9773
 
7.0%
r 8601
 
6.2%
l 7999
 
5.7%
a 7541
 
5.4%
Other values (44) 29709
21.3%
None
ValueCountFrequency (%)
é 9
81.8%
Á 1
 
9.1%
ó 1
 
9.1%

municipality
Text

Missing 

Distinct93
Distinct (%)16.7%
Missing18309
Missing (%)97.0%
Memory size147.5 KiB
2025-01-14T11:27:41.836084image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length25
Median length19
Mean length8.47935368
Min length4

Characters and Unicode

Total characters4723
Distinct characters49
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique37 ?
Unique (%)6.6%

Sample

1st rowRedding
2nd rowHamden
3rd rowHamden
4th rowPerkasie
5th rowPhiladelphia
ValueCountFrequency (%)
parksville 56
 
8.5%
fairfield 39
 
5.9%
westport 35
 
5.3%
kent 32
 
4.9%
norwalk 29
 
4.4%
lloyd 27
 
4.1%
harbor 27
 
4.1%
new 25
 
3.8%
quince 24
 
3.6%
mil 24
 
3.6%
Other values (98) 340
51.7%
2025-01-14T11:27:41.991422image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
l 421
 
8.9%
e 410
 
8.7%
a 396
 
8.4%
r 359
 
7.6%
i 356
 
7.5%
o 282
 
6.0%
n 258
 
5.5%
t 205
 
4.3%
s 184
 
3.9%
d 163
 
3.5%
Other values (39) 1689
35.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 3961
83.9%
Uppercase Letter 658
 
13.9%
Space Separator 101
 
2.1%
Other Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
l 421
10.6%
e 410
10.4%
a 396
10.0%
r 359
 
9.1%
i 356
 
9.0%
o 282
 
7.1%
n 258
 
6.5%
t 205
 
5.2%
s 184
 
4.6%
d 163
 
4.1%
Other values (13) 927
23.4%
Uppercase Letter
ValueCountFrequency (%)
P 108
16.4%
N 59
9.0%
W 58
 
8.8%
M 55
 
8.4%
H 49
 
7.4%
F 43
 
6.5%
L 41
 
6.2%
K 36
 
5.5%
B 35
 
5.3%
Q 28
 
4.3%
Other values (12) 146
22.2%
Other Punctuation
ValueCountFrequency (%)
, 1
50.0%
& 1
50.0%
Space Separator
ValueCountFrequency (%)
101
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4619
97.8%
Common 104
 
2.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
l 421
 
9.1%
e 410
 
8.9%
a 396
 
8.6%
r 359
 
7.8%
i 356
 
7.7%
o 282
 
6.1%
n 258
 
5.6%
t 205
 
4.4%
s 184
 
4.0%
d 163
 
3.5%
Other values (35) 1585
34.3%
Common
ValueCountFrequency (%)
101
97.1%
, 1
 
1.0%
- 1
 
1.0%
& 1
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4723
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
l 421
 
8.9%
e 410
 
8.7%
a 396
 
8.4%
r 359
 
7.6%
i 356
 
7.5%
o 282
 
6.0%
n 258
 
5.5%
t 205
 
4.3%
s 184
 
3.9%
d 163
 
3.5%
Other values (39) 1689
35.8%

locality
Text

Missing 

Distinct2520
Distinct (%)19.4%
Missing5869
Missing (%)31.1%
Memory size147.5 KiB
2025-01-14T11:27:42.177797image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length136
Median length96
Mean length26.34000154
Min length3

Characters and Unicode

Total characters342341
Distinct characters86
Distinct categories9 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1275 ?
Unique (%)9.8%

Sample

1st rowNew Haven. Yale University, Peabody Museum
2nd rowClinton. 245 Killingworth Turnpike
3rd rowClinton. 245 Killingworth Turnpike
4th rowClinton. 245 Killingworth Turnpike
5th rowClinton. 245 Killingworth Turnpike
ValueCountFrequency (%)
forest 3560
 
6.6%
experimental 2766
 
5.1%
bartlett 2744
 
5.1%
of 2288
 
4.2%
comp 1856
 
3.4%
miles 929
 
1.7%
transect 736
 
1.4%
mi 727
 
1.3%
national 657
 
1.2%
island 533
 
1.0%
Other values (3204) 37538
69.1%
2025-01-14T11:27:42.450286image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
41355
 
12.1%
e 29260
 
8.5%
a 26473
 
7.7%
t 25066
 
7.3%
o 21028
 
6.1%
r 19414
 
5.7%
n 17045
 
5.0%
i 15970
 
4.7%
l 15843
 
4.6%
s 12163
 
3.6%
Other values (76) 118724
34.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 240976
70.4%
Space Separator 41355
 
12.1%
Uppercase Letter 38660
 
11.3%
Decimal Number 10790
 
3.2%
Other Punctuation 9066
 
2.6%
Dash Punctuation 863
 
0.3%
Open Punctuation 261
 
0.1%
Close Punctuation 261
 
0.1%
Math Symbol 109
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 29260
12.1%
a 26473
11.0%
t 25066
10.4%
o 21028
8.7%
r 19414
8.1%
n 17045
 
7.1%
i 15970
 
6.6%
l 15843
 
6.6%
s 12163
 
5.0%
m 10319
 
4.3%
Other values (20) 48395
20.1%
Uppercase Letter
ValueCountFrequency (%)
F 4122
10.7%
C 4107
10.6%
B 4072
10.5%
E 3542
 
9.2%
S 2650
 
6.9%
M 2630
 
6.8%
N 2411
 
6.2%
R 2164
 
5.6%
P 1557
 
4.0%
A 1375
 
3.6%
Other values (16) 10030
25.9%
Other Punctuation
ValueCountFrequency (%)
. 6402
70.6%
, 1832
 
20.2%
/ 501
 
5.5%
' 120
 
1.3%
? 66
 
0.7%
; 48
 
0.5%
" 40
 
0.4%
& 31
 
0.3%
: 22
 
0.2%
# 4
 
< 0.1%
Decimal Number
ValueCountFrequency (%)
1 2029
18.8%
5 1908
17.7%
4 1565
14.5%
0 1214
11.3%
3 942
8.7%
6 915
8.5%
2 913
8.5%
7 511
 
4.7%
8 475
 
4.4%
9 318
 
2.9%
Close Punctuation
ValueCountFrequency (%)
] 212
81.2%
) 48
 
18.4%
} 1
 
0.4%
Math Symbol
ValueCountFrequency (%)
= 105
96.3%
~ 2
 
1.8%
+ 2
 
1.8%
Open Punctuation
ValueCountFrequency (%)
[ 213
81.6%
( 48
 
18.4%
Space Separator
ValueCountFrequency (%)
41355
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 863
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 279636
81.7%
Common 62705
 
18.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 29260
 
10.5%
a 26473
 
9.5%
t 25066
 
9.0%
o 21028
 
7.5%
r 19414
 
6.9%
n 17045
 
6.1%
i 15970
 
5.7%
l 15843
 
5.7%
s 12163
 
4.3%
m 10319
 
3.7%
Other values (46) 87055
31.1%
Common
ValueCountFrequency (%)
41355
66.0%
. 6402
 
10.2%
1 2029
 
3.2%
5 1908
 
3.0%
, 1832
 
2.9%
4 1565
 
2.5%
0 1214
 
1.9%
3 942
 
1.5%
6 915
 
1.5%
2 913
 
1.5%
Other values (20) 3630
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 342324
> 99.9%
None 17
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
41355
 
12.1%
e 29260
 
8.5%
a 26473
 
7.7%
t 25066
 
7.3%
o 21028
 
6.1%
r 19414
 
5.7%
n 17045
 
5.0%
i 15970
 
4.7%
l 15843
 
4.6%
s 12163
 
3.6%
Other values (72) 118707
34.7%
None
ValueCountFrequency (%)
í 8
47.1%
ç 4
23.5%
ö 4
23.5%
á 1
 
5.9%
Distinct155
Distinct (%)10.5%
Missing17391
Missing (%)92.2%
Memory size147.5 KiB
2025-01-14T11:27:42.603626image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.444067797
Min length1

Characters and Unicode

Total characters5080
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)2.6%

Sample

1st row61
2nd row61
3rd row638
4th row638
5th row1143
ValueCountFrequency (%)
1829 124
 
8.4%
61 104
 
7.1%
2896 60
 
4.1%
2134 59
 
4.0%
700 59
 
4.0%
638 56
 
3.8%
1000 53
 
3.6%
500 42
 
2.8%
1402 29
 
2.0%
1280 29
 
2.0%
Other values (145) 860
58.3%
2025-01-14T11:27:42.814389image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 934
18.4%
0 927
18.2%
2 635
12.5%
8 506
10.0%
6 441
8.7%
9 373
 
7.3%
3 369
 
7.3%
7 307
 
6.0%
4 294
 
5.8%
5 294
 
5.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5080
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 934
18.4%
0 927
18.2%
2 635
12.5%
8 506
10.0%
6 441
8.7%
9 373
 
7.3%
3 369
 
7.3%
7 307
 
6.0%
4 294
 
5.8%
5 294
 
5.8%

Most occurring scripts

ValueCountFrequency (%)
Common 5080
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 934
18.4%
0 927
18.2%
2 635
12.5%
8 506
10.0%
6 441
8.7%
9 373
 
7.3%
3 369
 
7.3%
7 307
 
6.0%
4 294
 
5.8%
5 294
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5080
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 934
18.4%
0 927
18.2%
2 635
12.5%
8 506
10.0%
6 441
8.7%
9 373
 
7.3%
3 369
 
7.3%
7 307
 
6.0%
4 294
 
5.8%
5 294
 
5.8%
Distinct110
Distinct (%)14.0%
Missing18082
Missing (%)95.8%
Memory size147.5 KiB
2025-01-14T11:27:42.941302image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length3.341836735
Min length1

Characters and Unicode

Total characters2620
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique29 ?
Unique (%)3.7%

Sample

1st row61
2nd row61
3rd row61
4th row1829
5th row61
ValueCountFrequency (%)
61 104
 
13.3%
1829 42
 
5.4%
1000 31
 
4.0%
1402 29
 
3.7%
1280 28
 
3.6%
2896 27
 
3.4%
91 27
 
3.4%
30 24
 
3.1%
2743 23
 
2.9%
1585 18
 
2.3%
Other values (100) 431
55.0%
2025-01-14T11:27:43.121583image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 597
22.8%
0 472
18.0%
2 332
12.7%
6 250
9.5%
8 204
 
7.8%
9 192
 
7.3%
5 186
 
7.1%
3 145
 
5.5%
4 134
 
5.1%
7 108
 
4.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2620
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 597
22.8%
0 472
18.0%
2 332
12.7%
6 250
9.5%
8 204
 
7.8%
9 192
 
7.3%
5 186
 
7.1%
3 145
 
5.5%
4 134
 
5.1%
7 108
 
4.1%

Most occurring scripts

ValueCountFrequency (%)
Common 2620
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 597
22.8%
0 472
18.0%
2 332
12.7%
6 250
9.5%
8 204
 
7.8%
9 192
 
7.3%
5 186
 
7.1%
3 145
 
5.5%
4 134
 
5.1%
7 108
 
4.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2620
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 597
22.8%
0 472
18.0%
2 332
12.7%
6 250
9.5%
8 204
 
7.8%
9 192
 
7.3%
5 186
 
7.1%
3 145
 
5.5%
4 134
 
5.1%
7 108
 
4.1%

verbatimElevation
Text

Missing 

Distinct195
Distinct (%)13.2%
Missing17391
Missing (%)92.2%
Memory size147.5 KiB
2025-01-14T11:27:43.355432image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length14
Median length12
Mean length8.446779661
Min length4

Characters and Unicode

Total characters12459
Distinct characters15
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)3.9%

Sample

1st row200-200 ft
2nd row200-200 ft
3rd row638 m
4th row638 m
5th row1143 m
ValueCountFrequency (%)
m 858
29.1%
ft 617
20.9%
200-200 104
 
3.5%
1829 84
 
2.8%
700 58
 
2.0%
638 56
 
1.9%
2134 56
 
1.9%
6000-6000 40
 
1.4%
500 39
 
1.3%
2896 33
 
1.1%
Other values (172) 1005
34.1%
2025-01-14T11:27:43.526251image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3601
28.9%
1475
11.8%
m 858
 
6.9%
- 784
 
6.3%
2 739
 
5.9%
1 683
 
5.5%
f 617
 
5.0%
t 617
 
5.0%
8 490
 
3.9%
4 490
 
3.9%
Other values (5) 2105
16.9%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 8108
65.1%
Lowercase Letter 2092
 
16.8%
Space Separator 1475
 
11.8%
Dash Punctuation 784
 
6.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 3601
44.4%
2 739
 
9.1%
1 683
 
8.4%
8 490
 
6.0%
4 490
 
6.0%
5 468
 
5.8%
3 458
 
5.6%
6 449
 
5.5%
9 375
 
4.6%
7 355
 
4.4%
Lowercase Letter
ValueCountFrequency (%)
m 858
41.0%
f 617
29.5%
t 617
29.5%
Space Separator
ValueCountFrequency (%)
1475
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 784
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 10367
83.2%
Latin 2092
 
16.8%

Most frequent character per script

Common
ValueCountFrequency (%)
0 3601
34.7%
1475
14.2%
- 784
 
7.6%
2 739
 
7.1%
1 683
 
6.6%
8 490
 
4.7%
4 490
 
4.7%
5 468
 
4.5%
3 458
 
4.4%
6 449
 
4.3%
Other values (2) 730
 
7.0%
Latin
ValueCountFrequency (%)
m 858
41.0%
f 617
29.5%
t 617
29.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 12459
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3601
28.9%
1475
11.8%
m 858
 
6.9%
- 784
 
6.3%
2 739
 
5.9%
1 683
 
5.5%
f 617
 
5.0%
t 617
 
5.0%
8 490
 
3.9%
4 490
 
3.9%
Other values (5) 2105
16.9%

decimalLatitude
Text

Missing 

Distinct2258
Distinct (%)16.9%
Missing5543
Missing (%)29.4%
Memory size147.5 KiB
2025-01-14T11:27:43.729249image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length15
Median length13
Mean length7.821736846
Min length1

Characters and Unicode

Total characters104209
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1064 ?
Unique (%)8.0%

Sample

1st row41.358888859069
2nd row41.358888859069
3rd row40.2804715
4th row39.9660548
5th row40.2804715
ValueCountFrequency (%)
44.049466 311
 
2.3%
44.059277 252
 
1.9%
44.062155 245
 
1.8%
3.9167 244
 
1.8%
44.061185 228
 
1.7%
44.041766 222
 
1.7%
44.059944 204
 
1.5%
44.050880 202
 
1.5%
41.3931 147
 
1.1%
41.3081 130
 
1.0%
Other values (2222) 11138
83.6%
2025-01-14T11:27:44.005113image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
4 15858
15.2%
. 13291
12.8%
3 11118
10.7%
0 11016
10.6%
1 8934
8.6%
6 8422
8.1%
5 7759
7.4%
7 6947
6.7%
2 6946
6.7%
8 6900
6.6%
Other values (2) 7018
6.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 89524
85.9%
Other Punctuation 13291
 
12.8%
Dash Punctuation 1394
 
1.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 15858
17.7%
3 11118
12.4%
0 11016
12.3%
1 8934
10.0%
6 8422
9.4%
5 7759
8.7%
7 6947
7.8%
2 6946
7.8%
8 6900
7.7%
9 5624
 
6.3%
Other Punctuation
ValueCountFrequency (%)
. 13291
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1394
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 104209
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 15858
15.2%
. 13291
12.8%
3 11118
10.7%
0 11016
10.6%
1 8934
8.6%
6 8422
8.1%
5 7759
7.4%
7 6947
6.7%
2 6946
6.7%
8 6900
6.6%
Other values (2) 7018
6.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 104209
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 15858
15.2%
. 13291
12.8%
3 11118
10.7%
0 11016
10.6%
1 8934
8.6%
6 8422
8.1%
5 7759
7.4%
7 6947
6.7%
2 6946
6.7%
8 6900
6.6%
Other values (2) 7018
6.7%

decimalLongitude
Text

Missing 

Distinct2296
Distinct (%)17.2%
Missing5543
Missing (%)29.4%
Memory size147.5 KiB
2025-01-14T11:27:44.219028image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length15
Median length11
Mean length8.931772123
Min length1

Characters and Unicode

Total characters118998
Distinct characters12
Distinct categories3 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1103 ?
Unique (%)8.3%

Sample

1st row-72.90380733456
2nd row-72.90380733456
3rd row-75.0506836
4th row-75.1956828
5th row-75.0506836
ValueCountFrequency (%)
71.273830 311
 
2.3%
71.304611 252
 
1.9%
71.297795 245
 
1.8%
136.1667 244
 
1.8%
71.307927 232
 
1.7%
71.303074 228
 
1.7%
71.319924 222
 
1.7%
71.308122 204
 
1.5%
71.2903479 160
 
1.2%
72.8972 148
 
1.1%
Other values (2281) 11077
83.1%
2025-01-14T11:27:44.495150image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 14237
12.0%
7 13643
11.5%
. 13252
11.1%
3 11911
10.0%
0 11591
9.7%
- 11234
9.4%
2 8871
7.5%
6 8250
6.9%
9 7707
6.5%
8 7413
6.2%
Other values (2) 10889
9.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 94512
79.4%
Other Punctuation 13252
 
11.1%
Dash Punctuation 11234
 
9.4%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 14237
15.1%
7 13643
14.4%
3 11911
12.6%
0 11591
12.3%
2 8871
9.4%
6 8250
8.7%
9 7707
8.2%
8 7413
7.8%
5 5580
 
5.9%
4 5309
 
5.6%
Other Punctuation
ValueCountFrequency (%)
. 13252
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11234
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 118998
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 14237
12.0%
7 13643
11.5%
. 13252
11.1%
3 11911
10.0%
0 11591
9.7%
- 11234
9.4%
2 8871
7.5%
6 8250
6.9%
9 7707
6.5%
8 7413
6.2%
Other values (2) 10889
9.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 118998
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 14237
12.0%
7 13643
11.5%
. 13252
11.1%
3 11911
10.0%
0 11591
9.7%
- 11234
9.4%
2 8871
7.5%
6 8250
6.9%
9 7707
6.5%
8 7413
6.2%
Other values (2) 10889
9.2%

geodeticDatum
Text

Missing 

Distinct2
Distinct (%)< 0.1%
Missing5666
Missing (%)30.0%
Memory size147.5 KiB
2025-01-14T11:27:44.553234image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length5
Median length5
Mean length5
Min length5

Characters and Unicode

Total characters66000
Distinct characters10
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowWGS84
2nd rowWGS84
3rd rowWGS84
4th rowWGS84
5th rowWGS84
ValueCountFrequency (%)
wgs84 12952
98.1%
nad27 248
 
1.9%
2025-01-14T11:27:44.649927image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
W 12952
19.6%
G 12952
19.6%
S 12952
19.6%
8 12952
19.6%
4 12952
19.6%
N 248
 
0.4%
A 248
 
0.4%
D 248
 
0.4%
2 248
 
0.4%
7 248
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 39600
60.0%
Decimal Number 26400
40.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
W 12952
32.7%
G 12952
32.7%
S 12952
32.7%
N 248
 
0.6%
A 248
 
0.6%
D 248
 
0.6%
Decimal Number
ValueCountFrequency (%)
8 12952
49.1%
4 12952
49.1%
2 248
 
0.9%
7 248
 
0.9%

Most occurring scripts

ValueCountFrequency (%)
Latin 39600
60.0%
Common 26400
40.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
W 12952
32.7%
G 12952
32.7%
S 12952
32.7%
N 248
 
0.6%
A 248
 
0.6%
D 248
 
0.6%
Common
ValueCountFrequency (%)
8 12952
49.1%
4 12952
49.1%
2 248
 
0.9%
7 248
 
0.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII 66000
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
W 12952
19.6%
G 12952
19.6%
S 12952
19.6%
8 12952
19.6%
4 12952
19.6%
N 248
 
0.4%
A 248
 
0.4%
D 248
 
0.4%
2 248
 
0.4%
7 248
 
0.4%
Distinct476
Distinct (%)3.6%
Missing5609
Missing (%)29.7%
Memory size147.5 KiB
2025-01-14T11:27:44.794569image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length7
Median length4
Mean length4.104246813
Min length2

Characters and Unicode

Total characters54410
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique228 ?
Unique (%)1.7%

Sample

1st row5359
2nd row5359
3rd row5359
4th row5359
5th row5359
ValueCountFrequency (%)
1850 5476
41.3%
1851 4930
37.2%
111111 329
 
2.5%
3036 110
 
0.8%
1583 104
 
0.8%
301 96
 
0.7%
103733 86
 
0.6%
5000 84
 
0.6%
300 79
 
0.6%
500 66
 
0.5%
Other values (466) 1897
 
14.3%
2025-01-14T11:27:45.007137image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 19011
34.9%
5 11398
20.9%
8 11362
20.9%
0 7286
 
13.4%
3 1449
 
2.7%
4 978
 
1.8%
7 822
 
1.5%
6 746
 
1.4%
2 681
 
1.3%
9 673
 
1.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 54406
> 99.9%
Other Punctuation 4
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 19011
34.9%
5 11398
20.9%
8 11362
20.9%
0 7286
 
13.4%
3 1449
 
2.7%
4 978
 
1.8%
7 822
 
1.5%
6 746
 
1.4%
2 681
 
1.3%
9 673
 
1.2%
Other Punctuation
ValueCountFrequency (%)
. 4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 54410
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 19011
34.9%
5 11398
20.9%
8 11362
20.9%
0 7286
 
13.4%
3 1449
 
2.7%
4 978
 
1.8%
7 822
 
1.5%
6 746
 
1.4%
2 681
 
1.3%
9 673
 
1.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 54410
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 19011
34.9%
5 11398
20.9%
8 11362
20.9%
0 7286
 
13.4%
3 1449
 
2.7%
4 978
 
1.8%
7 822
 
1.5%
6 746
 
1.4%
2 681
 
1.3%
9 673
 
1.2%

georeferencedBy
Text

Missing 

Distinct14
Distinct (%)4.3%
Missing18537
Missing (%)98.3%
Memory size147.5 KiB
2025-01-14T11:27:45.083698image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length26
Median length17
Mean length17.73860182
Min length13

Characters and Unicode

Total characters5836
Distinct characters42
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)1.5%

Sample

1st rowPiper L. Stepule
2nd rowPiper L. Stepule
3rd rowPeter A. Capainolo
4th rowKristof Zyskowski
5th rowNicholas J. Kerhoulas
ValueCountFrequency (%)
kristof 233
31.6%
zyskowski 233
31.6%
j 37
 
5.0%
gregory 24
 
3.3%
watkins-colwell 24
 
3.3%
peter 22
 
3.0%
a 22
 
3.0%
capainolo 22
 
3.0%
dornburg 14
 
1.9%
alex 14
 
1.9%
Other values (26) 93
 
12.6%
2025-01-14T11:27:45.217396image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 761
13.0%
o 607
 
10.4%
i 545
 
9.3%
k 497
 
8.5%
409
 
7.0%
r 364
 
6.2%
t 294
 
5.0%
y 269
 
4.6%
w 263
 
4.5%
K 251
 
4.3%
Other values (32) 1576
27.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4560
78.1%
Uppercase Letter 763
 
13.1%
Space Separator 409
 
7.0%
Other Punctuation 80
 
1.4%
Dash Punctuation 24
 
0.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 761
16.7%
o 607
13.3%
i 545
12.0%
k 497
10.9%
r 364
8.0%
t 294
 
6.4%
y 269
 
5.9%
w 263
 
5.8%
f 234
 
5.1%
e 163
 
3.6%
Other values (13) 563
12.3%
Uppercase Letter
ValueCountFrequency (%)
K 251
32.9%
Z 233
30.5%
C 49
 
6.4%
J 43
 
5.6%
A 39
 
5.1%
P 33
 
4.3%
W 30
 
3.9%
G 24
 
3.1%
D 17
 
2.2%
S 13
 
1.7%
Other values (6) 31
 
4.1%
Space Separator
ValueCountFrequency (%)
409
100.0%
Other Punctuation
ValueCountFrequency (%)
. 80
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 24
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5323
91.2%
Common 513
 
8.8%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 761
14.3%
o 607
11.4%
i 545
10.2%
k 497
9.3%
r 364
 
6.8%
t 294
 
5.5%
y 269
 
5.1%
w 263
 
4.9%
K 251
 
4.7%
f 234
 
4.4%
Other values (29) 1238
23.3%
Common
ValueCountFrequency (%)
409
79.7%
. 80
 
15.6%
- 24
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5836
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 761
13.0%
o 607
 
10.4%
i 545
 
9.3%
k 497
 
8.5%
409
 
7.0%
r 364
 
6.2%
t 294
 
5.0%
y 269
 
4.6%
w 263
 
4.5%
K 251
 
4.3%
Other values (32) 1576
27.0%

georeferencedDate
Text

Missing 

Distinct48
Distinct (%)0.6%
Missing10549
Missing (%)55.9%
Memory size147.5 KiB
2025-01-14T11:27:45.280430image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length10
Mean length9.131417578
Min length4

Characters and Unicode

Total characters75946
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique16 ?
Unique (%)0.2%

Sample

1st row2015
2nd row2015
3rd row2015
4th row2015
5th row2015
ValueCountFrequency (%)
2023-12-28 5807
69.8%
2015 1204
 
14.5%
2020-06-14 935
 
11.2%
2020-12-30 124
 
1.5%
2023-12-03 45
 
0.5%
2021-12-08 27
 
0.3%
2024-01-17 18
 
0.2%
2024-05-01 17
 
0.2%
2019-11-04 16
 
0.2%
2022-06-18 14
 
0.2%
Other values (38) 110
 
1.3%
2025-01-14T11:27:45.401255image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 27323
36.0%
- 14226
18.7%
0 10723
 
14.1%
1 8422
 
11.1%
3 6079
 
8.0%
8 5869
 
7.7%
5 1236
 
1.6%
4 1028
 
1.4%
6 994
 
1.3%
7 24
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 61720
81.3%
Dash Punctuation 14226
 
18.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 27323
44.3%
0 10723
 
17.4%
1 8422
 
13.6%
3 6079
 
9.8%
8 5869
 
9.5%
5 1236
 
2.0%
4 1028
 
1.7%
6 994
 
1.6%
7 24
 
< 0.1%
9 22
 
< 0.1%
Dash Punctuation
ValueCountFrequency (%)
- 14226
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 75946
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 27323
36.0%
- 14226
18.7%
0 10723
 
14.1%
1 8422
 
11.1%
3 6079
 
8.0%
8 5869
 
7.7%
5 1236
 
1.6%
4 1028
 
1.4%
6 994
 
1.3%
7 24
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 75946
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 27323
36.0%
- 14226
18.7%
0 10723
 
14.1%
1 8422
 
11.1%
3 6079
 
8.0%
8 5869
 
7.7%
5 1236
 
1.6%
4 1028
 
1.4%
6 994
 
1.3%
7 24
 
< 0.1%

georeferenceProtocol
Text

Missing 

Distinct3
Distinct (%)< 0.1%
Missing5610
Missing (%)29.7%
Memory size147.5 KiB
2025-01-14T11:27:45.452496image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length16
Mean length13.75980688
Min length11

Characters and Unicode

Total characters182400
Distinct characters18
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowdigital resource
2nd rowdigital resource
3rd rowdigital resource
4th rowdigital resource
5th rowdigital resource
ValueCountFrequency (%)
resource 7300
35.5%
digital 7216
35.1%
unspecified 5956
29.0%
physical 84
 
0.4%
2025-01-14T11:27:45.562493image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 26512
14.5%
i 26428
14.5%
r 14600
 
8.0%
s 13340
 
7.3%
c 13340
 
7.3%
u 13256
 
7.3%
d 13172
 
7.2%
7300
 
4.0%
l 7300
 
4.0%
a 7300
 
4.0%
Other values (8) 39852
21.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 175100
96.0%
Space Separator 7300
 
4.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 26512
15.1%
i 26428
15.1%
r 14600
8.3%
s 13340
7.6%
c 13340
7.6%
u 13256
7.6%
d 13172
7.5%
l 7300
 
4.2%
a 7300
 
4.2%
o 7300
 
4.2%
Other values (7) 32552
18.6%
Space Separator
ValueCountFrequency (%)
7300
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 175100
96.0%
Common 7300
 
4.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 26512
15.1%
i 26428
15.1%
r 14600
8.3%
s 13340
7.6%
c 13340
7.6%
u 13256
7.6%
d 13172
7.5%
l 7300
 
4.2%
a 7300
 
4.2%
o 7300
 
4.2%
Other values (7) 32552
18.6%
Common
ValueCountFrequency (%)
7300
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 182400
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 26512
14.5%
i 26428
14.5%
r 14600
 
8.0%
s 13340
 
7.3%
c 13340
 
7.3%
u 13256
 
7.3%
d 13172
 
7.2%
7300
 
4.0%
l 7300
 
4.0%
a 7300
 
4.0%
Other values (8) 39852
21.8%

georeferenceSources
Text

Missing 

Distinct14
Distinct (%)0.1%
Missing5615
Missing (%)29.8%
Memory size147.5 KiB
2025-01-14T11:27:45.624461image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length21
Median length15
Mean length9.898347295
Min length4

Characters and Unicode

Total characters131163
Distinct characters42
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNEVP
2nd rowNEVP
3rd rowNEVP
4th rowNEVP
5th rowNEVP
ValueCountFrequency (%)
unspecified 5957
31.8%
unit 3838
20.5%
gps 3838
20.5%
geolocate 1254
 
6.7%
google 785
 
4.2%
earth 713
 
3.8%
vertnet 649
 
3.5%
2014 290
 
1.5%
census 290
 
1.5%
tiger 290
 
1.5%
Other values (11) 847
 
4.5%
2025-01-14T11:27:45.750149image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 16291
12.4%
e 15708
12.0%
n 10145
 
7.7%
u 10138
 
7.7%
c 7413
 
5.7%
t 7162
 
5.5%
s 6614
 
5.0%
p 6243
 
4.8%
G 6167
 
4.7%
d 6099
 
4.6%
Other values (32) 39183
29.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 101255
77.2%
Uppercase Letter 23104
 
17.6%
Space Separator 5500
 
4.2%
Decimal Number 1160
 
0.9%
Other Punctuation 144
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 16291
16.1%
e 15708
15.5%
n 10145
10.0%
u 10138
10.0%
c 7413
7.3%
t 7162
7.1%
s 6614
6.5%
p 6243
 
6.2%
d 6099
 
6.0%
f 5957
 
5.9%
Other values (10) 9485
9.4%
Uppercase Letter
ValueCountFrequency (%)
G 6167
26.7%
S 4128
17.9%
P 4106
17.8%
E 2531
11.0%
L 1254
 
5.4%
O 1254
 
5.4%
N 923
 
4.0%
V 917
 
4.0%
T 296
 
1.3%
C 290
 
1.3%
Other values (6) 1238
 
5.4%
Decimal Number
ValueCountFrequency (%)
4 290
25.0%
1 290
25.0%
0 290
25.0%
2 290
25.0%
Space Separator
ValueCountFrequency (%)
5500
100.0%
Other Punctuation
ValueCountFrequency (%)
. 144
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 124359
94.8%
Common 6804
 
5.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 16291
13.1%
e 15708
12.6%
n 10145
 
8.2%
u 10138
 
8.2%
c 7413
 
6.0%
t 7162
 
5.8%
s 6614
 
5.3%
p 6243
 
5.0%
G 6167
 
5.0%
d 6099
 
4.9%
Other values (26) 32379
26.0%
Common
ValueCountFrequency (%)
5500
80.8%
4 290
 
4.3%
1 290
 
4.3%
0 290
 
4.3%
2 290
 
4.3%
. 144
 
2.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 131163
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 16291
12.4%
e 15708
12.0%
n 10145
 
7.7%
u 10138
 
7.7%
c 7413
 
5.7%
t 7162
 
5.5%
s 6614
 
5.0%
p 6243
 
4.8%
G 6167
 
4.7%
d 6099
 
4.6%
Other values (32) 39183
29.9%

georeferenceRemarks
Text

Missing 

Distinct562
Distinct (%)4.3%
Missing5661
Missing (%)30.0%
Memory size147.5 KiB
2025-01-14T11:27:45.932706image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length570
Median length446
Mean length102.1251799
Min length8

Characters and Unicode

Total characters1348563
Distinct characters84
Distinct categories11 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique291 ?
Unique (%)2.2%

Sample

1st rowprovisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG
2nd rowprovisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG
3rd rowprovisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG
4th rowprovisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG
5th rowprovisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG
ValueCountFrequency (%)
for 11797
 
5.4%
km 11604
 
5.3%
radius 10782
 
5.0%
georeference 7659
 
3.5%
to 6875
 
3.2%
by 5881
 
2.7%
was 5876
 
2.7%
that 5847
 
2.7%
only 5832
 
2.7%
ex 5813
 
2.7%
Other values (1631) 139388
64.1%
2025-01-14T11:27:46.194489image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
204184
15.1%
e 140668
 
10.4%
r 102754
 
7.6%
i 71247
 
5.3%
o 67231
 
5.0%
s 56371
 
4.2%
a 55673
 
4.1%
n 55507
 
4.1%
t 49761
 
3.7%
d 47386
 
3.5%
Other values (74) 497781
36.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 936316
69.4%
Space Separator 204186
 
15.1%
Decimal Number 112539
 
8.3%
Uppercase Letter 62764
 
4.7%
Other Punctuation 22473
 
1.7%
Dash Punctuation 9955
 
0.7%
Open Punctuation 131
 
< 0.1%
Close Punctuation 131
 
< 0.1%
Math Symbol 66
 
< 0.1%
Modifier Symbol 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 140668
15.0%
r 102754
11.0%
i 71247
 
7.6%
o 67231
 
7.2%
s 56371
 
6.0%
a 55673
 
5.9%
n 55507
 
5.9%
t 49761
 
5.3%
d 47386
 
5.1%
c 38921
 
4.2%
Other values (16) 250797
26.8%
Uppercase Letter
ValueCountFrequency (%)
S 12725
20.3%
F 7867
12.5%
M 7849
12.5%
A 7349
11.7%
D 5962
9.5%
G 2949
 
4.7%
C 2406
 
3.8%
L 2224
 
3.5%
O 2065
 
3.3%
N 1923
 
3.1%
Other values (16) 9445
15.0%
Decimal Number
ValueCountFrequency (%)
1 37695
33.5%
0 27546
24.5%
2 17351
15.4%
9 12223
 
10.9%
4 11178
 
9.9%
5 2315
 
2.1%
6 1714
 
1.5%
8 1016
 
0.9%
3 924
 
0.8%
7 577
 
0.5%
Other Punctuation
ValueCountFrequency (%)
, 8935
39.8%
. 6543
29.1%
: 3957
17.6%
/ 2222
 
9.9%
; 391
 
1.7%
' 293
 
1.3%
" 60
 
0.3%
& 48
 
0.2%
? 16
 
0.1%
% 8
 
< 0.1%
Math Symbol
ValueCountFrequency (%)
= 43
65.2%
+ 20
30.3%
~ 3
 
4.5%
Space Separator
ValueCountFrequency (%)
204184
> 99.9%
  2
 
< 0.1%
Open Punctuation
ValueCountFrequency (%)
( 118
90.1%
[ 13
 
9.9%
Close Punctuation
ValueCountFrequency (%)
) 118
90.1%
] 13
 
9.9%
Dash Punctuation
ValueCountFrequency (%)
- 9955
100.0%
Modifier Symbol
ValueCountFrequency (%)
` 1
100.0%
Currency Symbol
ValueCountFrequency (%)
¤ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 999080
74.1%
Common 349483
 
25.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 140668
14.1%
r 102754
 
10.3%
i 71247
 
7.1%
o 67231
 
6.7%
s 56371
 
5.6%
a 55673
 
5.6%
n 55507
 
5.6%
t 49761
 
5.0%
d 47386
 
4.7%
c 38921
 
3.9%
Other values (42) 313561
31.4%
Common
ValueCountFrequency (%)
204184
58.4%
1 37695
 
10.8%
0 27546
 
7.9%
2 17351
 
5.0%
9 12223
 
3.5%
4 11178
 
3.2%
- 9955
 
2.8%
, 8935
 
2.6%
. 6543
 
1.9%
: 3957
 
1.1%
Other values (22) 9916
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1348560
> 99.9%
None 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
204184
15.1%
e 140668
 
10.4%
r 102754
 
7.6%
i 71247
 
5.3%
o 67231
 
5.0%
s 56371
 
4.2%
a 55673
 
4.1%
n 55507
 
4.1%
t 49761
 
3.7%
d 47386
 
3.5%
Other values (72) 497778
36.9%
None
ValueCountFrequency (%)
  2
66.7%
¤ 1
33.3%

typeStatus
Text

Missing 

Distinct5
Distinct (%)22.7%
Missing18844
Missing (%)99.9%
Memory size147.5 KiB
2025-01-14T11:27:46.252891image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length10
Median length8
Mean length8.090909091
Min length8

Characters and Unicode

Total characters178
Distinct characters11
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)9.1%

Sample

1st rowhypotype
2nd rowparatype
3rd rowhypotype
4th rowhypotype
5th rowhypotype
ValueCountFrequency (%)
hypotype 13
59.1%
paratype 5
 
22.7%
topotype 2
 
9.1%
plesiotype 1
 
4.5%
holotype 1
 
4.5%
2025-01-14T11:27:46.361412image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
p 43
24.2%
y 35
19.7%
t 24
13.5%
e 23
12.9%
o 20
11.2%
h 14
 
7.9%
a 10
 
5.6%
r 5
 
2.8%
l 2
 
1.1%
s 1
 
0.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 178
100.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
p 43
24.2%
y 35
19.7%
t 24
13.5%
e 23
12.9%
o 20
11.2%
h 14
 
7.9%
a 10
 
5.6%
r 5
 
2.8%
l 2
 
1.1%
s 1
 
0.6%

Most occurring scripts

ValueCountFrequency (%)
Latin 178
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
p 43
24.2%
y 35
19.7%
t 24
13.5%
e 23
12.9%
o 20
11.2%
h 14
 
7.9%
a 10
 
5.6%
r 5
 
2.8%
l 2
 
1.1%
s 1
 
0.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII 178
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
p 43
24.2%
y 35
19.7%
t 24
13.5%
e 23
12.9%
o 20
11.2%
h 14
 
7.9%
a 10
 
5.6%
r 5
 
2.8%
l 2
 
1.1%
s 1
 
0.6%

identifiedBy
Text

Missing 

Distinct46
Distinct (%)4.1%
Missing17735
Missing (%)94.0%
Memory size147.5 KiB
2025-01-14T11:27:46.462559image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length26
Median length21
Mean length15.7020336
Min length6

Characters and Unicode

Total characters17759
Distinct characters50
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique10 ?
Unique (%)0.9%

Sample

1st rowGary P. Aronsen
2nd rowGary P. Aronsen
3rd rowJosé A. Ottenwalder
4th rowAngus J. Mossman
5th rowAngus J. Mossman
ValueCountFrequency (%)
jordan 278
 
8.9%
colosi 278
 
8.9%
g 278
 
8.9%
a 247
 
7.9%
mary 240
 
7.7%
turner 240
 
7.7%
kristof 101
 
3.2%
zyskowski 101
 
3.2%
alex 100
 
3.2%
dornburg 100
 
3.2%
Other values (91) 1159
37.1%
2025-01-14T11:27:46.638323image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1991
 
11.2%
r 1773
 
10.0%
o 1434
 
8.1%
n 1105
 
6.2%
a 1041
 
5.9%
e 976
 
5.5%
s 880
 
5.0%
i 864
 
4.9%
. 854
 
4.8%
l 730
 
4.1%
Other values (40) 6111
34.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 11749
66.2%
Uppercase Letter 3145
 
17.7%
Space Separator 1991
 
11.2%
Other Punctuation 854
 
4.8%
Dash Punctuation 20
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r 1773
15.1%
o 1434
12.2%
n 1105
9.4%
a 1041
8.9%
e 976
8.3%
s 880
7.5%
i 864
7.4%
l 730
 
6.2%
d 462
 
3.9%
y 415
 
3.5%
Other values (15) 2069
17.6%
Uppercase Letter
ValueCountFrequency (%)
A 467
14.8%
J 384
12.2%
C 371
11.8%
M 341
10.8%
G 309
9.8%
K 263
8.4%
T 244
7.8%
D 107
 
3.4%
N 103
 
3.3%
Z 101
 
3.2%
Other values (12) 455
14.5%
Space Separator
ValueCountFrequency (%)
1991
100.0%
Other Punctuation
ValueCountFrequency (%)
. 854
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 20
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 14894
83.9%
Common 2865
 
16.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
r 1773
 
11.9%
o 1434
 
9.6%
n 1105
 
7.4%
a 1041
 
7.0%
e 976
 
6.6%
s 880
 
5.9%
i 864
 
5.8%
l 730
 
4.9%
A 467
 
3.1%
d 462
 
3.1%
Other values (37) 5162
34.7%
Common
ValueCountFrequency (%)
1991
69.5%
. 854
29.8%
- 20
 
0.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 17758
> 99.9%
None 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1991
 
11.2%
r 1773
 
10.0%
o 1434
 
8.1%
n 1105
 
6.2%
a 1041
 
5.9%
e 976
 
5.5%
s 880
 
5.0%
i 864
 
4.9%
. 854
 
4.8%
l 730
 
4.1%
Other values (39) 6110
34.4%
None
ValueCountFrequency (%)
é 1
100.0%

dateIdentified
Text

Missing 

Distinct26
Distinct (%)2.7%
Missing17913
Missing (%)94.9%
Memory size147.5 KiB
2025-01-14T11:27:46.701265image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters3812
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.5%

Sample

1st row2016
2nd row2016
3rd row1985
4th row2016
5th row2016
ValueCountFrequency (%)
2008 271
28.4%
2009 257
27.0%
2007 130
13.6%
2012 126
13.2%
2016 26
 
2.7%
2011 22
 
2.3%
2020 22
 
2.3%
2010 22
 
2.3%
2024 18
 
1.9%
2023 15
 
1.6%
Other values (16) 44
 
4.6%
2025-01-14T11:27:46.817190image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1656
43.4%
2 1137
29.8%
8 276
 
7.2%
9 274
 
7.2%
1 250
 
6.6%
7 132
 
3.5%
6 33
 
0.9%
4 25
 
0.7%
3 21
 
0.6%
5 8
 
0.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3812
100.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1656
43.4%
2 1137
29.8%
8 276
 
7.2%
9 274
 
7.2%
1 250
 
6.6%
7 132
 
3.5%
6 33
 
0.9%
4 25
 
0.7%
3 21
 
0.6%
5 8
 
0.2%

Most occurring scripts

ValueCountFrequency (%)
Common 3812
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1656
43.4%
2 1137
29.8%
8 276
 
7.2%
9 274
 
7.2%
1 250
 
6.6%
7 132
 
3.5%
6 33
 
0.9%
4 25
 
0.7%
3 21
 
0.6%
5 8
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3812
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1656
43.4%
2 1137
29.8%
8 276
 
7.2%
9 274
 
7.2%
1 250
 
6.6%
7 132
 
3.5%
6 33
 
0.9%
4 25
 
0.7%
3 21
 
0.6%
5 8
 
0.2%

identificationRemarks
Text

Missing 

Distinct3
Distinct (%)100.0%
Missing18863
Missing (%)> 99.9%
Memory size147.5 KiB
2025-01-14T11:27:46.879061image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length57
Median length6
Mean length22.66666667
Min length5

Characters and Unicode

Total characters68
Distinct characters22
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)100.0%

Sample

1st rowreferenced on page 89 in the descripton of Agouti thomasi
2nd rowEaton
3rd rowThorpe
ValueCountFrequency (%)
referenced 1
8.3%
on 1
8.3%
page 1
8.3%
89 1
8.3%
in 1
8.3%
the 1
8.3%
descripton 1
8.3%
of 1
8.3%
agouti 1
8.3%
thomasi 1
8.3%
Other values (2) 2
16.7%
2025-01-14T11:27:46.991569image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
9
13.2%
e 8
11.8%
o 7
10.3%
n 5
 
7.4%
t 5
 
7.4%
r 4
 
5.9%
i 4
 
5.9%
h 3
 
4.4%
a 3
 
4.4%
p 3
 
4.4%
Other values (12) 17
25.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 54
79.4%
Space Separator 9
 
13.2%
Uppercase Letter 3
 
4.4%
Decimal Number 2
 
2.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 8
14.8%
o 7
13.0%
n 5
9.3%
t 5
9.3%
r 4
 
7.4%
i 4
 
7.4%
h 3
 
5.6%
a 3
 
5.6%
p 3
 
5.6%
g 2
 
3.7%
Other values (6) 10
18.5%
Uppercase Letter
ValueCountFrequency (%)
E 1
33.3%
A 1
33.3%
T 1
33.3%
Decimal Number
ValueCountFrequency (%)
8 1
50.0%
9 1
50.0%
Space Separator
ValueCountFrequency (%)
9
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 57
83.8%
Common 11
 
16.2%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 8
14.0%
o 7
12.3%
n 5
 
8.8%
t 5
 
8.8%
r 4
 
7.0%
i 4
 
7.0%
h 3
 
5.3%
a 3
 
5.3%
p 3
 
5.3%
g 2
 
3.5%
Other values (9) 13
22.8%
Common
ValueCountFrequency (%)
9
81.8%
8 1
 
9.1%
9 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 68
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
9
13.2%
e 8
11.8%
o 7
10.3%
n 5
 
7.4%
t 5
 
7.4%
r 4
 
5.9%
i 4
 
5.9%
h 3
 
4.4%
a 3
 
4.4%
p 3
 
4.4%
Other values (12) 17
25.0%
Distinct2018
Distinct (%)10.7%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:47.179758image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length43
Median length34
Mean length22.09201739
Min length3

Characters and Unicode

Total characters416788
Distinct characters53
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique703 ?
Unique (%)3.7%

Sample

1st rowTamias striatus fisheri
2nd rowPeromyscus leucopus noveboracensis
3rd rowPeromyscus leucopus noveboracensis
4th rowPeromyscus leucopus noveboracensis
5th rowPeromyscus leucopus noveboracensis
ValueCountFrequency (%)
peromyscus 1837
 
4.0%
cinereus 1489
 
3.2%
sorex 1193
 
2.6%
brevicauda 1125
 
2.4%
blarina 976
 
2.1%
zibethicus 898
 
2.0%
talpoides 868
 
1.9%
gapperi 848
 
1.8%
maniculatus 829
 
1.8%
leucopus 782
 
1.7%
Other values (2070) 35113
76.4%
2025-01-14T11:27:47.461035image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 41623
 
10.0%
i 36625
 
8.8%
a 35093
 
8.4%
u 30890
 
7.4%
e 30381
 
7.3%
27092
 
6.5%
r 26522
 
6.4%
o 25267
 
6.1%
n 22452
 
5.4%
c 20781
 
5.0%
Other values (43) 120062
28.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 370969
89.0%
Space Separator 27092
 
6.5%
Uppercase Letter 18716
 
4.5%
Other Punctuation 9
 
< 0.1%
Dash Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 41623
11.2%
i 36625
9.9%
a 35093
9.5%
u 30890
 
8.3%
e 30381
 
8.2%
r 26522
 
7.1%
o 25267
 
6.8%
n 22452
 
6.1%
c 20781
 
5.6%
l 16432
 
4.4%
Other values (16) 84903
22.9%
Uppercase Letter
ValueCountFrequency (%)
P 3107
16.6%
C 2505
13.4%
S 1952
10.4%
M 1925
10.3%
B 1452
7.8%
O 1312
7.0%
T 1217
 
6.5%
N 831
 
4.4%
L 676
 
3.6%
A 598
 
3.2%
Other values (13) 3141
16.8%
Other Punctuation
ValueCountFrequency (%)
. 7
77.8%
? 2
 
22.2%
Space Separator
ValueCountFrequency (%)
27092
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 389685
93.5%
Common 27103
 
6.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 41623
10.7%
i 36625
 
9.4%
a 35093
 
9.0%
u 30890
 
7.9%
e 30381
 
7.8%
r 26522
 
6.8%
o 25267
 
6.5%
n 22452
 
5.8%
c 20781
 
5.3%
l 16432
 
4.2%
Other values (39) 103619
26.6%
Common
ValueCountFrequency (%)
27092
> 99.9%
. 7
 
< 0.1%
? 2
 
< 0.1%
- 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 416788
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 41623
 
10.0%
i 36625
 
8.8%
a 35093
 
8.4%
u 30890
 
7.4%
e 30381
 
7.3%
27092
 
6.5%
r 26522
 
6.4%
o 25267
 
6.1%
n 22452
 
5.4%
c 20781
 
5.0%
Other values (43) 120062
28.8%
Distinct256
Distinct (%)1.4%
Missing153
Missing (%)0.8%
Memory size147.5 KiB
2025-01-14T11:27:47.626805image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length231
Median length222
Mean length176.5778336
Min length30

Characters and Unicode

Total characters3304301
Distinct characters50
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)0.1%

Sample

1st rowAnimalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Sciuromorpha; Sciurida; Sciuridae; Xerinae
2nd rowAnimalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae
3rd rowAnimalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae
4th rowAnimalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae
5th rowAnimalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae
ValueCountFrequency (%)
animalia 18713
 
8.8%
vertebrata 18713
 
8.8%
chordata 18713
 
8.8%
amniota 18711
 
8.8%
mammalia 18711
 
8.8%
theriiformes-----theria-placentalia-epitheria 15223
 
7.1%
rodentia 8426
 
3.9%
preptotheria-anagalida-simplicidentata 8425
 
3.9%
myomorpha 5919
 
2.8%
myodonta 5717
 
2.7%
Other values (374) 76277
35.7%
2025-01-14T11:27:47.869575image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 452709
13.7%
i 335054
 
10.1%
e 250563
 
7.6%
r 228161
 
6.9%
t 207108
 
6.3%
; 194835
 
5.9%
194835
 
5.9%
o 167342
 
5.1%
- 154910
 
4.7%
n 124331
 
3.8%
Other values (40) 994453
30.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2464975
74.6%
Uppercase Letter 294746
 
8.9%
Other Punctuation 194835
 
5.9%
Space Separator 194835
 
5.9%
Dash Punctuation 154910
 
4.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 452709
18.4%
i 335054
13.6%
e 250563
10.2%
r 228161
9.3%
t 207108
8.4%
o 167342
 
6.8%
n 124331
 
5.0%
m 121823
 
4.9%
l 112560
 
4.6%
h 109691
 
4.4%
Other values (14) 355633
14.4%
Uppercase Letter
ValueCountFrequency (%)
A 54733
18.6%
M 41057
13.9%
T 37832
12.8%
P 36866
12.5%
C 34091
11.6%
E 20860
 
7.1%
S 20718
 
7.0%
V 19658
 
6.7%
R 9908
 
3.4%
F 3662
 
1.2%
Other values (13) 15361
 
5.2%
Other Punctuation
ValueCountFrequency (%)
; 194835
100.0%
Space Separator
ValueCountFrequency (%)
194835
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 154910
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2759721
83.5%
Common 544580
 
16.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 452709
16.4%
i 335054
12.1%
e 250563
 
9.1%
r 228161
 
8.3%
t 207108
 
7.5%
o 167342
 
6.1%
n 124331
 
4.5%
m 121823
 
4.4%
l 112560
 
4.1%
h 109691
 
4.0%
Other values (37) 650379
23.6%
Common
ValueCountFrequency (%)
; 194835
35.8%
194835
35.8%
- 154910
28.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3304301
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 452709
13.7%
i 335054
 
10.1%
e 250563
 
7.6%
r 228161
 
6.9%
t 207108
 
6.3%
; 194835
 
5.9%
194835
 
5.9%
o 167342
 
5.1%
- 154910
 
4.7%
n 124331
 
3.8%
Other values (40) 994453
30.1%

kingdom
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing153
Missing (%)0.8%
Memory size147.5 KiB
2025-01-14T11:27:47.918401image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters149704
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAnimalia
2nd rowAnimalia
3rd rowAnimalia
4th rowAnimalia
5th rowAnimalia
ValueCountFrequency (%)
animalia 18713
100.0%
2025-01-14T11:27:48.013854image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 37426
25.0%
a 37426
25.0%
A 18713
12.5%
n 18713
12.5%
m 18713
12.5%
l 18713
12.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 130991
87.5%
Uppercase Letter 18713
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 37426
28.6%
a 37426
28.6%
n 18713
14.3%
m 18713
14.3%
l 18713
14.3%
Uppercase Letter
ValueCountFrequency (%)
A 18713
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 149704
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 37426
25.0%
a 37426
25.0%
A 18713
12.5%
n 18713
12.5%
m 18713
12.5%
l 18713
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 149704
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 37426
25.0%
a 37426
25.0%
A 18713
12.5%
n 18713
12.5%
m 18713
12.5%
l 18713
12.5%

phylum
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing153
Missing (%)0.8%
Memory size147.5 KiB
2025-01-14T11:27:48.055949image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters149704
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowChordata
2nd rowChordata
3rd rowChordata
4th rowChordata
5th rowChordata
ValueCountFrequency (%)
chordata 18713
100.0%
2025-01-14T11:27:48.152926image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 37426
25.0%
C 18713
12.5%
h 18713
12.5%
o 18713
12.5%
r 18713
12.5%
d 18713
12.5%
t 18713
12.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 130991
87.5%
Uppercase Letter 18713
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 37426
28.6%
h 18713
14.3%
o 18713
14.3%
r 18713
14.3%
d 18713
14.3%
t 18713
14.3%
Uppercase Letter
ValueCountFrequency (%)
C 18713
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 149704
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 37426
25.0%
C 18713
12.5%
h 18713
12.5%
o 18713
12.5%
r 18713
12.5%
d 18713
12.5%
t 18713
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 149704
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 37426
25.0%
C 18713
12.5%
h 18713
12.5%
o 18713
12.5%
r 18713
12.5%
d 18713
12.5%
t 18713
12.5%

class
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing155
Missing (%)0.8%
Memory size147.5 KiB
2025-01-14T11:27:48.195499image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters149688
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMammalia
2nd rowMammalia
3rd rowMammalia
4th rowMammalia
5th rowMammalia
ValueCountFrequency (%)
mammalia 18711
100.0%
2025-01-14T11:27:48.296330image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 56133
37.5%
m 37422
25.0%
M 18711
 
12.5%
l 18711
 
12.5%
i 18711
 
12.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 130977
87.5%
Uppercase Letter 18711
 
12.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 56133
42.9%
m 37422
28.6%
l 18711
 
14.3%
i 18711
 
14.3%
Uppercase Letter
ValueCountFrequency (%)
M 18711
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 149688
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 56133
37.5%
m 37422
25.0%
M 18711
 
12.5%
l 18711
 
12.5%
i 18711
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 149688
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 56133
37.5%
m 37422
25.0%
M 18711
 
12.5%
l 18711
 
12.5%
i 18711
 
12.5%

order
Text

Missing 

Distinct29
Distinct (%)0.2%
Missing401
Missing (%)2.1%
Memory size147.5 KiB
2025-01-14T11:27:48.360258image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length16
Median length8
Mean length9.418846466
Min length4

Characters and Unicode

Total characters173919
Distinct characters32
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRodentia
2nd rowRodentia
3rd rowRodentia
4th rowRodentia
5th rowRodentia
ValueCountFrequency (%)
rodentia 8426
45.6%
eulipotyphla 2517
 
13.6%
carnivora 2371
 
12.8%
artiodactyla 1530
 
8.3%
chiroptera 1102
 
6.0%
primates 953
 
5.2%
lagomorpha 348
 
1.9%
diprotodontia 248
 
1.3%
didelphimorphia 213
 
1.2%
perissodactyla 157
 
0.9%
Other values (19) 600
 
3.2%
2025-01-14T11:27:48.483663image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 23290
13.4%
i 18724
10.8%
o 18258
10.5%
t 17054
9.8%
e 11398
 
6.6%
n 11252
 
6.5%
r 10808
 
6.2%
d 10797
 
6.2%
R 8426
 
4.8%
l 7208
 
4.1%
Other values (22) 36704
21.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 155454
89.4%
Uppercase Letter 18465
 
10.6%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 23290
15.0%
i 18724
12.0%
o 18258
11.7%
t 17054
11.0%
e 11398
7.3%
n 11252
7.2%
r 10808
7.0%
d 10797
6.9%
l 7208
 
4.6%
p 7203
 
4.6%
Other values (10) 19462
12.5%
Uppercase Letter
ValueCountFrequency (%)
R 8426
45.6%
C 3681
19.9%
E 2517
 
13.6%
A 1589
 
8.6%
P 1254
 
6.8%
D 495
 
2.7%
L 348
 
1.9%
M 89
 
0.5%
S 31
 
0.2%
H 29
 
0.2%
Other values (2) 6
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 173919
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 23290
13.4%
i 18724
10.8%
o 18258
10.5%
t 17054
9.8%
e 11398
 
6.6%
n 11252
 
6.5%
r 10808
 
6.2%
d 10797
 
6.2%
R 8426
 
4.8%
l 7208
 
4.1%
Other values (22) 36704
21.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 173919
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 23290
13.4%
i 18724
10.8%
o 18258
10.5%
t 17054
9.8%
e 11398
 
6.6%
n 11252
 
6.5%
r 10808
 
6.2%
d 10797
 
6.2%
R 8426
 
4.8%
l 7208
 
4.1%
Other values (22) 36704
21.1%

family
Text

Missing 

Distinct130
Distinct (%)0.7%
Missing838
Missing (%)4.4%
Memory size147.5 KiB
2025-01-14T11:27:48.617534image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length16
Mean length9.660749945
Min length6

Characters and Unicode

Total characters174164
Distinct characters44
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9 ?
Unique (%)< 0.1%

Sample

1st rowSciuridae
2nd rowCricetidae
3rd rowCricetidae
4th rowCricetidae
5th rowCricetidae
ValueCountFrequency (%)
cricetidae 4134
22.9%
soricidae 2286
12.7%
sciuridae 1673
 
9.3%
muridae 1073
 
6.0%
bovidae 840
 
4.7%
canidae 662
 
3.7%
mustelidae 501
 
2.8%
dipodidae 458
 
2.5%
cercopithecidae 421
 
2.3%
vespertilionidae 407
 
2.3%
Other values (120) 5573
30.9%
2025-01-14T11:27:48.813704image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 29146
16.7%
e 27525
15.8%
a 20548
11.8%
d 19366
11.1%
r 13299
7.6%
c 10222
 
5.9%
o 8596
 
4.9%
t 7181
 
4.1%
C 5980
 
3.4%
S 4069
 
2.3%
Other values (34) 28232
16.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 156136
89.6%
Uppercase Letter 18028
 
10.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 29146
18.7%
e 27525
17.6%
a 20548
13.2%
d 19366
12.4%
r 13299
8.5%
c 10222
 
6.5%
o 8596
 
5.5%
t 7181
 
4.6%
u 3703
 
2.4%
l 3119
 
2.0%
Other values (13) 13431
8.6%
Uppercase Letter
ValueCountFrequency (%)
C 5980
33.2%
S 4069
22.6%
M 1908
 
10.6%
P 1156
 
6.4%
B 866
 
4.8%
D 865
 
4.8%
V 485
 
2.7%
H 481
 
2.7%
L 448
 
2.5%
F 393
 
2.2%
Other values (11) 1377
 
7.6%

Most occurring scripts

ValueCountFrequency (%)
Latin 174164
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 29146
16.7%
e 27525
15.8%
a 20548
11.8%
d 19366
11.1%
r 13299
7.6%
c 10222
 
5.9%
o 8596
 
4.9%
t 7181
 
4.1%
C 5980
 
3.4%
S 4069
 
2.3%
Other values (34) 28232
16.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 174164
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 29146
16.7%
e 27525
15.8%
a 20548
11.8%
d 19366
11.1%
r 13299
7.6%
c 10222
 
5.9%
o 8596
 
4.9%
t 7181
 
4.1%
C 5980
 
3.4%
S 4069
 
2.3%
Other values (34) 28232
16.2%

genus
Text

Missing 

Distinct610
Distinct (%)3.5%
Missing1196
Missing (%)6.3%
Memory size147.5 KiB
2025-01-14T11:27:49.128040image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length17
Median length14
Mean length8.099717035
Min length3

Characters and Unicode

Total characters143122
Distinct characters47
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique106 ?
Unique (%)0.6%

Sample

1st rowTamias
2nd rowPeromyscus
3rd rowPeromyscus
4th rowPeromyscus
5th rowPeromyscus
ValueCountFrequency (%)
peromyscus 1837
 
10.4%
sorex 1193
 
6.8%
blarina 976
 
5.5%
clethrionomys 742
 
4.2%
ondatra 631
 
3.6%
microtus 435
 
2.5%
tamias 398
 
2.3%
napaeozapus 365
 
2.1%
canis 345
 
2.0%
procyon 329
 
1.9%
Other values (600) 10419
59.0%
2025-01-14T11:27:49.391458image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 15060
 
10.5%
o 13424
 
9.4%
a 11837
 
8.3%
r 11272
 
7.9%
e 9347
 
6.5%
u 9121
 
6.4%
i 8228
 
5.7%
c 6244
 
4.4%
y 5923
 
4.1%
l 5700
 
4.0%
Other values (37) 46966
32.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 125452
87.7%
Uppercase Letter 17670
 
12.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 15060
12.0%
o 13424
10.7%
a 11837
9.4%
r 11272
 
9.0%
e 9347
 
7.5%
u 9121
 
7.3%
i 8228
 
6.6%
c 6244
 
5.0%
y 5923
 
4.7%
l 5700
 
4.5%
Other values (14) 29296
23.4%
Uppercase Letter
ValueCountFrequency (%)
P 3034
17.2%
C 2309
13.1%
S 1909
10.8%
M 1595
9.0%
O 1307
7.4%
T 1213
 
6.9%
B 1189
 
6.7%
N 830
 
4.7%
L 663
 
3.8%
A 585
 
3.3%
Other values (13) 3036
17.2%

Most occurring scripts

ValueCountFrequency (%)
Latin 143122
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 15060
 
10.5%
o 13424
 
9.4%
a 11837
 
8.3%
r 11272
 
7.9%
e 9347
 
6.5%
u 9121
 
6.4%
i 8228
 
5.7%
c 6244
 
4.4%
y 5923
 
4.1%
l 5700
 
4.0%
Other values (37) 46966
32.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 143122
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 15060
 
10.5%
o 13424
 
9.4%
a 11837
 
8.3%
r 11272
 
7.9%
e 9347
 
6.5%
u 9121
 
6.4%
i 8228
 
5.7%
c 6244
 
4.4%
y 5923
 
4.1%
l 5700
 
4.0%
Other values (37) 46966
32.8%

specificEpithet
Text

Missing 

Distinct954
Distinct (%)5.8%
Missing2296
Missing (%)12.2%
Memory size147.5 KiB
2025-01-14T11:27:49.595193image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length22
Median length16
Mean length8.552082076
Min length2

Characters and Unicode

Total characters141708
Distinct characters27
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique231 ?
Unique (%)1.4%

Sample

1st rowstriatus
2nd rowleucopus
3rd rowleucopus
4th rowleucopus
5th rowleucopus
ValueCountFrequency (%)
brevicauda 987
 
6.0%
leucopus 775
 
4.7%
cinereus 746
 
4.5%
gapperi 708
 
4.3%
maniculatus 683
 
4.1%
zibethicus 631
 
3.8%
insignis 365
 
2.2%
lotor 328
 
2.0%
canadensis 320
 
1.9%
hudsonicus 292
 
1.8%
Other values (942) 10743
64.8%
2025-01-14T11:27:49.867092image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
i 16262
11.5%
s 15024
10.6%
u 14751
10.4%
a 13768
9.7%
e 10570
 
7.5%
n 9249
 
6.5%
r 9097
 
6.4%
c 8731
 
6.2%
l 6248
 
4.4%
t 5924
 
4.2%
Other values (17) 32084
22.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 141700
> 99.9%
Space Separator 8
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 16262
11.5%
s 15024
10.6%
u 14751
10.4%
a 13768
9.7%
e 10570
 
7.5%
n 9249
 
6.5%
r 9097
 
6.4%
c 8731
 
6.2%
l 6248
 
4.4%
t 5924
 
4.2%
Other values (16) 32076
22.6%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 141700
> 99.9%
Common 8
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
i 16262
11.5%
s 15024
10.6%
u 14751
10.4%
a 13768
9.7%
e 10570
 
7.5%
n 9249
 
6.5%
r 9097
 
6.4%
c 8731
 
6.2%
l 6248
 
4.4%
t 5924
 
4.2%
Other values (16) 32076
22.6%
Common
ValueCountFrequency (%)
8
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 141708
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i 16262
11.5%
s 15024
10.6%
u 14751
10.4%
a 13768
9.7%
e 10570
 
7.5%
n 9249
 
6.5%
r 9097
 
6.4%
c 8731
 
6.2%
l 6248
 
4.4%
t 5924
 
4.2%
Other values (17) 32084
22.6%

infraspecificEpithet
Text

Missing 

Distinct755
Distinct (%)7.3%
Missing8470
Missing (%)44.9%
Memory size147.5 KiB
2025-01-14T11:27:50.033228image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length16
Median length14
Mean length9.011735283
Min length3

Characters and Unicode

Total characters93686
Distinct characters27
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique255 ?
Unique (%)2.5%

Sample

1st rowfisheri
2nd rownoveboracensis
3rd rownoveboracensis
4th rownoveboracensis
5th rownoveboracensis
ValueCountFrequency (%)
talpoides 835
 
8.0%
cinereus 743
 
7.1%
noveboracensis 678
 
6.5%
insignis 368
 
3.5%
ochraceus 358
 
3.4%
pennsylvanicus 303
 
2.9%
fumeus 275
 
2.6%
zibethicus 267
 
2.6%
gracilis 226
 
2.2%
domesticus 193
 
1.9%
Other values (745) 6154
59.2%
2025-01-14T11:27:50.258177image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
s 11487
12.3%
i 10927
11.7%
e 8921
9.5%
a 7771
8.3%
n 7368
 
7.9%
u 6877
 
7.3%
c 5749
 
6.1%
r 5693
 
6.1%
o 5382
 
5.7%
l 4160
 
4.4%
Other values (17) 19351
20.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 93682
> 99.9%
Space Separator 4
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
s 11487
12.3%
i 10927
11.7%
e 8921
9.5%
a 7771
8.3%
n 7368
 
7.9%
u 6877
 
7.3%
c 5749
 
6.1%
r 5693
 
6.1%
o 5382
 
5.7%
l 4160
 
4.4%
Other values (16) 19347
20.7%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 93682
> 99.9%
Common 4
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
s 11487
12.3%
i 10927
11.7%
e 8921
9.5%
a 7771
8.3%
n 7368
 
7.9%
u 6877
 
7.3%
c 5749
 
6.1%
r 5693
 
6.1%
o 5382
 
5.7%
l 4160
 
4.4%
Other values (16) 19347
20.7%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 93686
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
s 11487
12.3%
i 10927
11.7%
e 8921
9.5%
a 7771
8.3%
n 7368
 
7.9%
u 6877
 
7.3%
c 5749
 
6.1%
r 5693
 
6.1%
o 5382
 
5.7%
l 4160
 
4.4%
Other values (17) 19351
20.7%
Distinct12
Distinct (%)0.1%
Missing153
Missing (%)0.8%
Memory size147.5 KiB
2025-01-14T11:27:50.319303image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length11
Median length10
Mean length8.513119222
Min length5

Characters and Unicode

Total characters159306
Distinct characters24
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowSubspecies
2nd rowSubspecies
3rd rowSubspecies
4th rowSubspecies
5th rowSubspecies
ValueCountFrequency (%)
subspecies 10397
55.6%
species 6125
32.7%
genus 1098
 
5.9%
family 507
 
2.7%
class 246
 
1.3%
order 142
 
0.8%
superfamily 117
 
0.6%
subfamily 49
 
0.3%
suborder 26
 
0.1%
infraorder 3
 
< 0.1%
Other values (2) 3
 
< 0.1%
2025-01-14T11:27:50.431041image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
e 34431
21.6%
s 28509
17.9%
i 17196
10.8%
S 16716
10.5%
p 16641
10.4%
c 16522
10.4%
u 11691
 
7.3%
b 10475
 
6.6%
n 1101
 
0.7%
G 1098
 
0.7%
Other values (14) 4926
 
3.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 140593
88.3%
Uppercase Letter 18713
 
11.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 34431
24.5%
s 28509
20.3%
i 17196
12.2%
p 16641
11.8%
c 16522
11.8%
u 11691
 
8.3%
b 10475
 
7.5%
n 1101
 
0.8%
a 922
 
0.7%
l 921
 
0.7%
Other values (7) 2184
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
S 16716
89.3%
G 1098
 
5.9%
F 507
 
2.7%
C 246
 
1.3%
O 142
 
0.8%
I 3
 
< 0.1%
T 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 159306
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 34431
21.6%
s 28509
17.9%
i 17196
10.8%
S 16716
10.5%
p 16641
10.4%
c 16522
10.4%
u 11691
 
7.3%
b 10475
 
6.6%
n 1101
 
0.7%
G 1098
 
0.7%
Other values (14) 4926
 
3.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 159306
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 34431
21.6%
s 28509
17.9%
i 17196
10.8%
S 16716
10.5%
p 16641
10.4%
c 16522
10.4%
u 11691
 
7.3%
b 10475
 
6.6%
n 1101
 
0.7%
G 1098
 
0.7%
Other values (14) 4926
 
3.1%
Distinct1069
Distinct (%)5.8%
Missing385
Missing (%)2.0%
Memory size147.5 KiB
2025-01-14T11:27:50.618390image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length93
Median length52
Mean length14.2923002
Min length4

Characters and Unicode

Total characters264136
Distinct characters74
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique314 ?
Unique (%)1.7%

Sample

1st rowHowell, 1925
2nd row(Fischer, 1829)
3rd row(Fischer, 1829)
4th row(Fischer, 1829)
5th row(Fischer, 1829)
ValueCountFrequency (%)
linnaeus 2832
 
7.1%
1758 2192
 
5.5%
miller 1126
 
2.8%
1830 1009
 
2.5%
1792 1008
 
2.5%
kerr 992
 
2.5%
gapper 835
 
2.1%
1766 748
 
1.9%
fischer 739
 
1.8%
1829 716
 
1.8%
Other values (587) 27903
69.6%
2025-01-14T11:27:50.875453image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 22049
 
8.3%
21619
 
8.2%
, 18697
 
7.1%
e 16160
 
6.1%
8 14713
 
5.6%
r 12026
 
4.6%
n 11112
 
4.2%
a 10637
 
4.0%
( 10300
 
3.9%
) 10300
 
3.9%
Other values (64) 116523
44.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 107537
40.7%
Decimal Number 73876
28.0%
Space Separator 21619
 
8.2%
Uppercase Letter 20554
 
7.8%
Other Punctuation 19866
 
7.5%
Open Punctuation 10300
 
3.9%
Close Punctuation 10300
 
3.9%
Dash Punctuation 84
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 16160
15.0%
r 12026
11.2%
n 11112
10.3%
a 10637
9.9%
i 8633
8.0%
s 7811
 
7.3%
l 7125
 
6.6%
o 5410
 
5.0%
u 4875
 
4.5%
h 3392
 
3.2%
Other values (21) 20356
18.9%
Uppercase Letter
ValueCountFrequency (%)
L 3638
17.7%
G 2465
12.0%
M 2080
10.1%
B 1711
 
8.3%
S 1498
 
7.3%
K 1273
 
6.2%
F 943
 
4.6%
C 827
 
4.0%
O 808
 
3.9%
R 753
 
3.7%
Other values (15) 4558
22.2%
Decimal Number
ValueCountFrequency (%)
1 22049
29.8%
8 14713
19.9%
7 8249
 
11.2%
9 7426
 
10.1%
5 5148
 
7.0%
2 4789
 
6.5%
0 3422
 
4.6%
3 3333
 
4.5%
6 2765
 
3.7%
4 1982
 
2.7%
Other Punctuation
ValueCountFrequency (%)
, 18697
94.1%
. 754
 
3.8%
& 409
 
2.1%
' 6
 
< 0.1%
Space Separator
ValueCountFrequency (%)
21619
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10300
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10300
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 84
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 136045
51.5%
Latin 128091
48.5%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 16160
 
12.6%
r 12026
 
9.4%
n 11112
 
8.7%
a 10637
 
8.3%
i 8633
 
6.7%
s 7811
 
6.1%
l 7125
 
5.6%
o 5410
 
4.2%
u 4875
 
3.8%
L 3638
 
2.8%
Other values (46) 40664
31.7%
Common
ValueCountFrequency (%)
1 22049
16.2%
21619
15.9%
, 18697
13.7%
8 14713
10.8%
( 10300
7.6%
) 10300
7.6%
7 8249
 
6.1%
9 7426
 
5.5%
5 5148
 
3.8%
2 4789
 
3.5%
Other values (8) 12755
9.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII 264000
99.9%
None 136
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 22049
 
8.4%
21619
 
8.2%
, 18697
 
7.1%
e 16160
 
6.1%
8 14713
 
5.6%
r 12026
 
4.6%
n 11112
 
4.2%
a 10637
 
4.0%
( 10300
 
3.9%
) 10300
 
3.9%
Other values (58) 116387
44.1%
None
ValueCountFrequency (%)
ü 83
61.0%
è 27
 
19.9%
ö 12
 
8.8%
ä 7
 
5.1%
É 5
 
3.7%
é 2
 
1.5%
Distinct1166
Distinct (%)6.2%
Missing153
Missing (%)0.8%
Memory size147.5 KiB
2025-01-14T11:27:51.071802image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length143
Median length121
Mean length82.7007428
Min length31

Characters and Unicode

Total characters1547579
Distinct characters60
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique294 ?
Unique (%)1.6%

Sample

1st rowEastern Chipmunk; chipmunks; squirrels; rodents; mammals; vertebrates; chordates; animals
2nd rowWhite-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals
3rd rowWhite-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals
4th rowWhite-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals
5th rowWhite-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals
ValueCountFrequency (%)
mammals 18748
 
11.1%
vertebrates 18713
 
11.1%
chordates 18713
 
11.1%
animals 18713
 
11.1%
rodents 8561
 
5.1%
mice 7296
 
4.3%
carnivores 4733
 
2.8%
shrews 3336
 
2.0%
mouse 2787
 
1.7%
squirrels 2585
 
1.5%
Other values (1028) 64018
38.1%
2025-01-14T11:27:51.343827image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 152163
 
9.8%
e 149527
 
9.7%
149490
 
9.7%
s 133715
 
8.6%
; 118068
 
7.6%
r 116349
 
7.5%
t 95576
 
6.2%
m 94542
 
6.1%
o 69125
 
4.5%
l 60718
 
3.9%
Other values (50) 408306
26.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 1230864
79.5%
Space Separator 149490
 
9.7%
Other Punctuation 118585
 
7.7%
Uppercase Letter 39691
 
2.6%
Dash Punctuation 8820
 
0.6%
Final Punctuation 128
 
< 0.1%
Control 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 152163
12.4%
e 149527
12.1%
s 133715
10.9%
r 116349
9.5%
t 95576
 
7.8%
m 94542
 
7.7%
o 69125
 
5.6%
l 60718
 
4.9%
i 59895
 
4.9%
n 53167
 
4.3%
Other values (17) 246087
20.0%
Uppercase Letter
ValueCountFrequency (%)
S 6525
16.4%
M 5563
14.0%
W 3144
 
7.9%
R 2708
 
6.8%
B 2683
 
6.8%
A 2449
 
6.2%
N 2218
 
5.6%
C 1850
 
4.7%
G 1827
 
4.6%
V 1298
 
3.3%
Other values (15) 9426
23.7%
Other Punctuation
ValueCountFrequency (%)
; 118068
99.6%
' 509
 
0.4%
. 4
 
< 0.1%
? 4
 
< 0.1%
Space Separator
ValueCountFrequency (%)
149490
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 8820
100.0%
Final Punctuation
ValueCountFrequency (%)
128
100.0%
Control
ValueCountFrequency (%)
 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 1270555
82.1%
Common 277024
 
17.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 152163
12.0%
e 149527
11.8%
s 133715
10.5%
r 116349
9.2%
t 95576
 
7.5%
m 94542
 
7.4%
o 69125
 
5.4%
l 60718
 
4.8%
i 59895
 
4.7%
n 53167
 
4.2%
Other values (42) 285778
22.5%
Common
ValueCountFrequency (%)
149490
54.0%
; 118068
42.6%
- 8820
 
3.2%
' 509
 
0.2%
128
 
< 0.1%
. 4
 
< 0.1%
? 4
 
< 0.1%
 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1547439
> 99.9%
Punctuation 128
 
< 0.1%
None 12
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 152163
 
9.8%
e 149527
 
9.7%
149490
 
9.7%
s 133715
 
8.6%
; 118068
 
7.6%
r 116349
 
7.5%
t 95576
 
6.2%
m 94542
 
6.1%
o 69125
 
4.5%
l 60718
 
3.9%
Other values (47) 408166
26.4%
Punctuation
ValueCountFrequency (%)
128
100.0%
None
ValueCountFrequency (%)
ü 11
91.7%
 1
 
8.3%

nomenclaturalCode
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:51.397214image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters75464
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowICZN
2nd rowICZN
3rd rowICZN
4th rowICZN
5th rowICZN
ValueCountFrequency (%)
iczn 18866
100.0%
2025-01-14T11:27:51.494556image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
I 18866
25.0%
C 18866
25.0%
Z 18866
25.0%
N 18866
25.0%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 75464
100.0%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
I 18866
25.0%
C 18866
25.0%
Z 18866
25.0%
N 18866
25.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 75464
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
I 18866
25.0%
C 18866
25.0%
Z 18866
25.0%
N 18866
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 75464
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
I 18866
25.0%
C 18866
25.0%
Z 18866
25.0%
N 18866
25.0%

taxonRemarks
Text

Constant 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size147.5 KiB
2025-01-14T11:27:51.543948image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Length

Max length41
Median length41
Mean length41
Min length41

Characters and Unicode

Total characters773506
Distinct characters18
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowAnimals and Plants: Vertebrates - Mammals
2nd rowAnimals and Plants: Vertebrates - Mammals
3rd rowAnimals and Plants: Vertebrates - Mammals
4th rowAnimals and Plants: Vertebrates - Mammals
5th rowAnimals and Plants: Vertebrates - Mammals
ValueCountFrequency (%)
animals 18866
16.7%
and 18866
16.7%
plants 18866
16.7%
vertebrates 18866
16.7%
18866
16.7%
mammals 18866
16.7%
2025-01-14T11:27:51.655452image/svg+xmlMatplotlib v3.9.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 113196
14.6%
94330
12.2%
s 75464
9.8%
e 56598
 
7.3%
m 56598
 
7.3%
l 56598
 
7.3%
n 56598
 
7.3%
t 56598
 
7.3%
r 37732
 
4.9%
A 18866
 
2.4%
Other values (8) 150928
19.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 565980
73.2%
Space Separator 94330
 
12.2%
Uppercase Letter 75464
 
9.8%
Dash Punctuation 18866
 
2.4%
Other Punctuation 18866
 
2.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 113196
20.0%
s 75464
13.3%
e 56598
10.0%
m 56598
10.0%
l 56598
10.0%
n 56598
10.0%
t 56598
10.0%
r 37732
 
6.7%
b 18866
 
3.3%
d 18866
 
3.3%
Uppercase Letter
ValueCountFrequency (%)
A 18866
25.0%
P 18866
25.0%
V 18866
25.0%
M 18866
25.0%
Space Separator
ValueCountFrequency (%)
94330
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 18866
100.0%
Other Punctuation
ValueCountFrequency (%)
: 18866
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 641444
82.9%
Common 132062
 
17.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 113196
17.6%
s 75464
11.8%
e 56598
8.8%
m 56598
8.8%
l 56598
8.8%
n 56598
8.8%
t 56598
8.8%
r 37732
 
5.9%
A 18866
 
2.9%
b 18866
 
2.9%
Other values (5) 94330
14.7%
Common
ValueCountFrequency (%)
94330
71.4%
- 18866
 
14.3%
: 18866
 
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 773506
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 113196
14.6%
94330
12.2%
s 75464
9.8%
e 56598
 
7.3%
m 56598
 
7.3%
l 56598
 
7.3%
n 56598
 
7.3%
t 56598
 
7.3%
r 37732
 
4.9%
A 18866
 
2.4%
Other values (8) 150928
19.5%